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FfeREWORD 

This volume is a text-4escribing how to apply differential 
-♦equations in panel studies of car e^ -dec is ion making processes. 
Two cqntributions are particularly noteworthy. First, the re- 
port shows how a dynamic mathematical .model can be foprmuJ.ated 
to reflect the important -theoretical idea that career expecta- 
tions gradually evolve in a continuous process over time, s***"- 
ondly,'the report advocates use of the dynamic theoretical 
to\ generate forecasts of career expectations. Such forecasts 
provide much stronger" tests of theory thait, methods currently 
general use'. Procedures for dalculatinokand evaluating the 
- forecasts are descr^ibed in detail in the text. The report is 
one. product of a three-year longitudinal study of developixxg ^ 
career expecliations in which the differential -equation methodoj.- 
- ogy will be applied. 
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ABSTRACT 

This volume contains a theoretical rationale for ar differential 
equation model of the process by- which career expectations of youth 
evolve and presents a detailed explication of the technical infor- 
mation needed to use th6 model. Three im>ortant "feidvantages of the 
differential equation model are reviewed.V Pirstr in'.the differen- 
tial equation model, the dependent variables &re rates of change 
over time in career expectations of youth; hence, theory expressed 
by differential equations automatically accommodates change over 
time. In contrast, most current- models express' static theory in 
which cros&~sec,tional differences among individuals are the sub- 
ject of inquiry. Secondly, the differential equation model express- 
es all causal feedback suggested by theoretical analysis; whereas, 
alternative models frequently neglect causal feedback. Finally, ' 
the differential equation mbdel provides a built-in facility, for ,. 
carrying out empirical tests of theory by evaluation of the ac- 
curacy of forecasts made by the model. «| 

' The technical e^cplication includes: (1) a ifeview of math- 
ematical and statistical material needed to understand the dif- 
ferential equation model i (2) derivation of calculating formulas 
for application of the differential equation model, and (3) de- 
scription 'of a computer program written in conjunction with this 
volume. The computer program is designed to carry put calculations 
needed for application of the differential equation model. 
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CHAPTER 1 

" "NTRODUCTION 



Purpose of Volume 

This volume is writtep in connection with a three-year 
longitudinal study of career expectations of high school stu- 
dents; the study is funded by the National iTistitute of ^duca- 
tiqn. The intent of the longitudinal- study is to contribute to 
scientific understanding of the process by which youth foi;!^ 
.career expectations such as educational, occupational, and in- 
^come expectations. To help achieve this intent , the theory *f ojr 
the study is expresfjed by systems of differential-equations. 
The dif ferential^equation^ methodology achieves three desirable 
outcomes : ^ • 

* 

(a1 The methodology expijesses verbal theory that career 
choice is a continuously evolving process rather than 
an event * 

(b) The methodology facilitates examination of two- 
■ ' directional cause-and-ef feet relationships 

(c) The differential, equation system expresses theory in 
m such a manner that forecasting (prediction) to any^ 

. point along a continuous time scale is an imnfediate 
consequence of' the theory, thus , predictive tests of 
theory are encouraged . ' 

Clearly , the differential-equation methodology holds a much 
broader appeal tnan the application to study of J^areer expecta- 
tions. It presents a venicl.^ for expressing the dynamics of 
natural systems in a manner that is unparallelei^d by standard 
practice in tne application of statistical procedures 'and con- 
tains the essential elements for incorporating predictions (fore 
casting) into the expression and testing of theory. ' Exar.'.ples of 
topics for which thesd features are useful include migration 
and economic growtfi 4^^^ organizational behavior? voting behavior 
and political attitudes, and criminology. In fepite of :-.he po- 
tentially ^de application of the methodological proc - res, 
they have not. been applied very often, primarily becaus of poor 
dissemination to substantive researchers , including those work- 
ing on development of' career expectations. This volume, there- 
fore, is designed to present a clear explication of the 



differential-equation methodology vritten in a style, permitting • 
use- by researchers with moderate amount of technical training. 
The intent is to avoid a purely technical exercise; rather, the 
focus centers on the juxt.aposition ot techniques with substan- 
tive Issues. - ■ ^ 

A number of years ago Robert Merton wrote:* 

This limited account hf.s, at the very least, 
pointed to the need for a closer connection 
between theory and empirical research. The ~ 
prevailing division of the two is manifest 
. in marked discontinuiti^*- of empirical re- 
search, dn the one hand, and- systematic 
theorizing unsustained by empirical test, • 
on the other (Merton,. 1957: 99). ^. 

One goal of this volume is to help reduce the division between 
method and tneory. * * 

Outline of the Volume 

There are six chapters in the volume. Chapter 2 contains a 
theoretical and conceptual discussion of the use of differential 
equations to 'represent career planning processes. Chapter 3 
presents a review of seliected mathematical .and statistical topics 
needed for Chapter 4 and Chapter 5. The intent of Chapter 3 is 
to conununicate the basic- concepts, omitting rigorous proofs. 
Chapter develops the mathematics of differential equations, 
presents justification . for using ordinary least squares in the 
statistic. ^ analysis, and describes a computer program that can 
be used to estimate coefficients of the differential equation 
system. Chapter 5 draws on the technical material presented in 
Cnapters 3 and 4 to describe interpretations of differential- 
equation systems applied to career expectation yariabl-es. Topics 
include comparison of interpretations ' of effects based- on cros.s- 
lagged regressions to interpretations based on differential equa- 
tions, analysis of tne time path of career expectations , includ- 
ing oscillations and equilibr;L)im conditions , development of* • 
standardization methods, for the coef f icients ' of differential 
equations, and presentation of a generalized correlation for 
assessing accuracy pf fotecasts. ThB final chapter summarizes 
the volume.. 



. ,, CHAPTER 2 

CONCSPTUALIZATION AND INTE^RETATION ' 

The purpose Qf this chapter is to introduce the idea that* 
differential equations can -be used to' express some important 
concepts and theory related to development of educational- and 
occupational expectations. The first ""Sfec^ion of the chapter' 
explicates a simplified cros^sectional path model of career ex- 
pectations. The path model serves as a "reference helping' to 
motivate the succeeding discussion of differential equations; 
the features of the differential equation systems are contrasted 
to those of .the" cross- sectional path model . A second section 
presents a brief theoretical rationale for using differential ^ 
equations in research on development of career expectations . 
The third section develops a specific differential equation 
model t9 describe the variables included in, the path model in 
section one. In the fourth section, interpretation of the dif- 
ferential equation system- is discussed. The final section sum- 
marizes th^ chapter. , The chapt^Sr ib not intended to contain a 
thorough technical justification ' for every poi,nt, • Technical 
descriptions are contained in Chapters 4 and 5. 
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A Cross-Sectional Path Model 



Although the study of which this volume is a part addresses 
knotty theoretical and concept ii a 1 issues that cross the boundaries 
of academic *disciplines, the guiding paradigm^ of the research is 
^rawn from status attainment work in the field of sociology, A 
path .model of the processes by which parental statuses are trans- 
mitted to offspring forms the fundamental point of departu'-o. 

.ion (describes a simplified version of such a model which 
can be u\ed as a reference' wlidn^ 'examining a differential-equation 
model of t>h«IKsame variables. To simplify the exposition, the 
example i's le¥s complicated than m?ny path modtels in the pub- 
lished empirical literature related -to status attainment process- 
es (e.g., Hauser, 1972)* Nevertheless , the example does, capture 
the major conceptual and theoretical ideas in the status attain- 
ment literature. . . ' . 

• For illustrative purposes, in developinfcf the model it will 
be helpful to identify a small set 'of variables comifiOnly used in 
status afetairiment research and to classify them into three broad 
categories-, as follows : 



Exogenous variables 



Parental socioeconomic status (x ) 

Mental ability , (xj) 

2 



Intervening variabiles 

1 * ' ' V ' ■ 

' Academic performance ' (y ) 

Significant other educational (y ) 

expectation of -ego 

t 

Signifix:ant other occupational (y ) 

expectation pf ego- 

Eqo*s own educational expectation (y ) 
'n Ego*s own occupational expectation (y^) 

Attainment variables 

Educational attainment {z } 

Occupational attainment (z^) 

For purposes of illustrating the model precise definitions and 
operational procedures for these variables are not" needed, but 
sorae description of the variables and terminology is warranted, 
especially for scholars who are not familiar with status attain- 
ment research. 

/ 

First, note that the variables are^'classif ied into three 
categories: exogenous variables, intervening variables, and 
attainment variables. Exogenous variables refer to background 
characteristics that affect career planning but are not affected 
by any other variables in the model. In this example parental ' 
socioeconomic status and mentTal ability are classified as exo- 
genous variables . Parental socioeconomic status nearly' always 
includes father's educational level and father's occupational 
status. Family income, mother ' s. education and mother's occupa- 
tional status' also are included frequently. In most of the re- 
cent research, these status variables are treated empirically 
as distinct var:j.ables (e.g., Hauser', 1972; Sewell and Hauser, 
li975; Featherman and. Hauser, 1977), but fbr the illustration 
presented here, the simplicity of a single}, aggregate parental- 
status, variable (SES) i's preferable. Mental ability is opera- 
tional ized by use of a/ standardized ability test. In recent 
work it is treated as dependent on' SES rather than as an exo- 
genous variable, but this t^e.atment is difficult to justify and 
does hot alter most substantive conclusions very much. For 



current purposes, it is useful to present the example with two 
exogenous variables. 

The intervening variables refer, to educational and cccupa- 
tional expectations. Academic performance (school grad^js) is 
also included. To avoid confusion with the Freudian cbncqpt of 
"ego" , it is important to note that in t.iC status attainment lit- 
erature the term ego refers to the individual on whom attention 
is centered; the term is used in contrast to the term "signifi- 
cant other" which refers to an individual other than ego who may 
exert some influence on ego's attitudes and/or behavior. For 
example, suppose JacR is a higjt) school sophomore, the level of 
education that he expects to achieve ij-lustrates the variable 
identified as "ego's*' own educational -expectation . The level 
of education that Jack's mother expects him to achieve is an 
laxample of the variable called "significant other educational 
expectation of egd." 
- • 

The variables called attainment variables refer to the so- 
cioeconomic achievement of ego. Achievement should be carefully 
\dif f erentiated from aspi^tion or expectation. 

It should be reiterated that the illustration is not a compre- 
hensive ffipdel. In published research, parental SES is generally 
disaggregated into its components, a more detailed list of 
significant-other variables generalj.y is includ^, and, ihcr'eas- 
ingly, income attainments and, sometimes- iilcome expectations also* 
are studied. The important theoretical, conceptual and method- 
ological features of status-attainment research can bte illustrated 
with this list of variables, however. 

Figure 1 displays a p^th diagram of the basic theory. A' 
straight, single-headed arrow denotes a cause-and-ef f act relation; 
whereas, the curved, double-headed arrows indicate uiianalyzed 
correlations (i.e., nb causal relatioi^ship specifiedK It should 
be noted that the two-directional arrows c6nnecti*hg pairs of inr- 
tervening variables depart from conventional path diagrams.^ Nor- 
mally, these arrows are omitted. They are included here because 
their px^sence more "accurately reflects the state of the theory^, 
than t^eir omission. Omitting the"^ curved arrow connecting, say 
significant other /educa^onal, expectation to significant other 
occupational expectation implies that neiir^er one affedts the 
other. It is more accurate to state that the theory lacks the 
power to specify a one-directional relationship, and since the / 
statistrlcal metnodology demands ."the assumption of ohe-directional( 
effects, the cause and effect relationship generating the corr^la*' 
tion Hetween the two variables is left unanalyzed. 

"tJote especially three features of the model.' First, there 
are no two-directional effects hypothesized. Secondly, the 
model for the intervening variables ignores change over time. 
Finally, the three subsets of variable^ comprise a "block-recursive 
system. That is, exogenous Variables may affect the other two 
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^blocks" of variables but are unaffected by th^m,^ and ^in,tervening 
variables ir.ay affect attainmeAt variabies but pOt affected by 
attainment -variables. (The iittervening'-^variables ngrmally are de 
f ineci over the adolescent yeans. ) \ . 

** • r . ^ ^ ^ 

' As represented in figu/e 1, there^is no dire'ct effect of ^, the 
Gxogejious variables on attainment variables,, because there are no 
arrows leading directly from parental ,SES or mental ability to at 
tainment. -This is an important theoretical feai^ure of the.mo4#l.- 
By hypothesis, intervening variables comprise the mechanisms' by 
which status is transmitted- if rom parent tp child.- In addition,- 
by hypotnesis, educational achiev&nent is the route by; which 
parental SES , mental ability,; and career expectations get trans- 
lated, into occupational status. In simplified form, the raodsl 
may be diagrammed as follows. 



SES 




Mental , 


— > 


Abil^^ty 





Grades 
EKpectations 



Education 



Occupational 
status 



Of cour'se, data seldom support this parsimonious viewpoint in 
eiiery particul-a^^^ ^^it the theoretical model' is approximated fre- 
quently i*i empirical study {e.g., Sew^ll and Hauser, 1^75; 
Rehberg and Rosenthal, 197&; Duncan, Featherman , and Duncan, 
1972; Alexander and Eckland, 1975; Cosby, et al. , 1979). 

ft 

The path diagram 'is a heuristic picture of an equation sys- 
tem. The major hypotheses are expressed by a set of linear equa- 
tions, in thG'following manner - 
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Process Variables 
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Perform^ce 



SOed, ^ 
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Figure 1 . Illustrative Path Model of Status Attainment Processes. 



Notes: SES socioeconomic statu 



so " signifij:ant other 
ed, ? education 



Occ. - occupation 



where tlie 



and . are constants (path coef 



f icients) 
u ■ and 



indexing the' etffictc of variable j on variable 'if the 

V. are unmeasured "disturbance" variables. The variances 
1 • 1 ■ 

of the disturbance variables index the degree to which. dependent 
varjdables are contrall-ed or influenced by the measured variables 
hypothesized to affect them; the^ smaller the variance of a dis- 
turbance variable, the la^rger the influence of the measured 
.variables . 



It is important to. notice that equation system (1) does nq-'c 
correspond exactly to the path diagram in figure 1, First, for 
simplicity the distuj^baAce. variables are omitted from the path 
diagram; this omission marks a mild departure from eonven- 

SecJoi^dly, there are'' more path - coefficients (a*s, b*s, m =»/ 
s*s inAequation system (1)} than there are paths in f ig- 
.pAth 



tion 
^*s and 
ure 1 



r q*s 

in 

theory ; 



The .pAjLh diagram oversimplifies to emphasize the 
wRereas, the ^^qiiat ions include all paths that typically would be 
calculated . in empirical application. An" indication of how well 
the theory fitg can be 'gained by observing how close to zero are 
the calculated paths corresponding to missing arrows in figure 1, 
If-th'e theory. were sufficiently convincing/ the postulated zero 
paths could be afssunifed zero, and the information th^t so^ne paths 
are zero could ^*th^ Ss used to improve estimation of the other 
paths. In the li^^atWe, however, it is generally assumed that 
the theory is ncft >'strong^, 'enough to m^rit setting any paths to 
zero a ' priori . ' 

There are two fiJatures of the model th^t are important ■ in 
the present context. First, there are no t^-directional cause- 
and-e^fect relationshl'pg hypothesized.- This (means that in no 
case "is there a path coefficient included to ^iijiidex the effect of 
v on X th^re is a path coefficient included to 



index the ef- 



X on 



Thus, for example, 



^21 pr^ent in the model; 



feet of 

its presence indicates a hypothesized effect of academic per- 
formance on the level of education that significant others ex- 
pect^ ego to achieve.- On the other |^and, b*2 is not present in 

the model; its absence. indicates an implicit hypothesis that 
ego's academic performance is not influenced by the level of ed- 
ucational expectation held for him/her by significant others 
(such as parents! k * • 

The seconds* important feature the model is its static 
conceptualization of the process variables. Notice that changes 
over time in the process variables ' are- not part of the jnodel . 
The model is about eross-sectior^l differences among individuals 
not, about changes over tin^e tof ' a particular, individual . Al- 
. though this point is ndt i^ew, it does not appear to be widely 
"linderstoodi This distinction between change over '^time and 
cross-sectional differences among individuals forros an 
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impprtanl part of the justification for applying differential 
equations to the study of career planning. In .consequence, the 
next .few paragraphs develop an algebraic treatment of the approp- 
riate interpretation of path coefficients calculated from. cross- 
sect ionai data.. , , - 

The purpose of ^this discussion is to review tlie meaning of 
the, phrase: sfat ist^ical ^control" by linear regression; when 
regression methods are applied to cross-sectional/ data. The 
coRjmentary 'also applies tp- the concept of . direct, effect in path 
analysis. The fundamental idea is to examine thg impact of a 
single independent variable on' a dependent variable while hold- 
ing all other' independent variables "constant." For this exer- 
cise, assum*e that the dependent variable is ego's edi^cational ex- 
pectation (y^) and that one wishes to examine the eJSf^t of- 

signi,ficant others' educational ©K^ctation of ego while 
"controlling" or' "holding constant" the other inde^ndent vari- 
ables affecting ^4 ' ' ' 

To develop this idea, consider two individuals who have^the 
same mental ability, same parental SES, same signif icant-oti^ 
occupational expectation, arid the same level of the "d.isturbaYice" 
variable, but who differ by one unit with respect to significant- 
other educati^onal expedtation. Suppose one wished to discover ' 
to what degree" two such individuals differ in the level .of edu- 
cational cxWcta^tion (y^)- Define the following^-^aboration of 

the notational ^heme: » 

I 

= parental SES for person 1 and person 2, 

respectively ' " ' 

« 

= mental ability for person 1 and person 2, 
respectively 

i 

= academic performance' .(grade average) for person 1 
and person 2 , respec,tively 

= ' signif icant-other educational expectation for 
person 1 and person 2, respectively 

^ , J signif icajit-other occupational expectation' for 

r person 1 and person 2, respectively * 



^11' 


^12 


^21' 


^22 


^11' 


^12 


^21' 


^2 2 


^31' 


^32 




^42 


"41' 


"42 



= ego's own educational expectation for .person 1 
^v.d person 2, respectively 

= disturbance term for person 1 and person 2, 
respectively 



Keeping in mind the conditions just established, inser,t this no- 
*iation into equation- (Id) for person 1 and person 2: 

+ ''43y3x * "41 (person 1) 

^43^32 "42 (P®^^°" 2) 
Subtracting both sides of equation {2b) from equation <2a) yie.lds'i^ 

^41 - ^42 : ^^40 - ^40^ "^'^41 ^^11 " ^12^ ^42^^*21 " ^22^ 

^. ^4L.^yil - ^12^ ■^^42^^21 - ^22^ - 

^43^^31 - ^32^ ^"4^ - ^42' 

since we have constructed the situation so that all independent 
variables are "constant" -except significant-other educational ex- 
pectation s of^ ^go and have stipulated .a unit difference between • ! 
the two person's significant-other educational expectations, the 
following facts hold: ■ 

= 0, 




arefore reduces tp^: 



41 ^42 42 

-I 

In words, ceteris paribus, a unit dif f er*ence b/etween to persons 
on signif ;cant-other educational^ expectation, generates a dif- 
ference in eg9's own 'educational expectation .equal tp the path 
coefficient (bj-,) • More generally, ^if " 21 ^ "^'"^ 

not z^ro, the ratio of^ th^ differenc/ in thr dependent, variable 
to the difference in the independent/ variable, "everything else' 
being constant" is the path cpeff iciest , viz: 

(y41 - y42^/^y21 ~ ^22? " ^42* 

This is the meaning of statistida! "control" by regression 
in cross-sectional data. For the present discussion, it is im- 
portant to note that when path analysis is applied to cross- 
secjtional data, the path 'coefficients index differences between 



ind ividuals , ' not changes over time . The relationship between in- 
dividual differences and change over time is seldoa explicit i^,ee, 
however, Coleman, 1968 and Kout ^nd Morgan^. 1975) . One of the 
virtues pf differential-equation analysis is that it can explicate 
this relationship. 



■ s 



Theoretical Basis for a Differential Equation Model 

Theoretical Background 



• ■ Ti\e, developmental theme of forming career orientations is 
pervasive in the vocational psychology literature. Fc^ example, 
the idea is. repeatedly stated in the WJ^itings of Donald Super, 
who at one point expressed it in the following terms: 

•PROPOSITION l._ .^Vo cational develgpment is an qngoing 
(H ^ntinuous , and generally irreversible process . VO" 
catiOnai preferences and competencies ... change with 
time and experience, making choice and adjustment a 
continuous process (Super, et al . , 1957: 89, emphas- 
^ • is in the original) . 

Similarly, Ginzberg and associates state: 

Our basic assumption was that an individual never 
reaches the ultimate decision at a single moment 
in time, but through a series of decisions over a 
'period of many years; the cumulative impact Ts the 
' determining factor (Ginzberg^ et al. ^ 1951: .27), 

This theme can also be found in the -waitings o;^ 4ffany (bti^er theo^ 
rists (e.g., Pietrofesa and Splete, 1976; Tiedeman, '1961; Rodgers, 
1966; Bl'du, et al . , 1956). After .reviewing several "macro- 
theories," Picou, Curry and Kotchkiss indicated the folloWjLn^ 
general characterization of the theoretical literature. 



f 



The macro-fc^eoretical approaches reyiewed above 
have §ever41 common themes* First, all of the 
above theorists have implicitly or explicitly 
noted the de^^e.lopmental character of occupational 
choice and placem^ent. The problems of career 
choice ar)d attainments are clearly limited to a 
life-cycte framework. Labor market entry and 
oareer patterns tend to be viewed in conjunction 
with indiyidual maturation arid growth (Picou, 
Curry and Hotchkiss, 1976 : 12) . . 

It is undoubtedly obvious to most of the research . community 
that career orientations are formed in a gradual process over 
time, yet operatiomal jM^ocedures commonly found in empirical 
study do not reflect tlVis obvious point. While the theoretical 
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literature has been helpful in.pdinting out the dynamic nature of 
the process, a proposition stated in the general terms' such as 
those used by Super and associates, quoted above, is of little 
use in empirical research. It is imperative that the general 
idea be translated into exact hypotheses. Again quoting Merton, 

Much of what is described in textbooks as^ sociolog- 
ical theory consists of general orientations toward 
substantive materials. Such orientations invblve 
broad postulates which indicate types of variables 
which 4re somehow to be taken into account rather 
than specifying determinant relationships between 
particular variables. • Indispensable though these 
orientations are, they provide only the broadest 
framework for empirical inquiry (Merton, 1957: ^ 
87-*88). . . 

Although most status-attainmept models have been confined to 
one-directional causal systems (for notable exceptions see Hout 
and Mogran, 1976; Woelfel and Haller, 1971; and Nolle^ 1973) there 
is ample reason to expect that several of the key proce^ vari- 
ables in career planning exercise reciprocal effec1;s on each 
other. For example, as described in the preceeding sectiorr in - 
most models of the process, academic performance and significant - 
other variables are assumed to affect' youths ' career orientations; 
ef-fects of youths' c-areer orientations on academic performanqje and 
significant other variables are assum^ zero/ Yet it is' plausible 
that students adjust their academic effort^in response tg their 
educational and occupational ambitions. -<ft is also plausible that 
significant others, in part, adjust their expectations of youth 
to conform to the career orientations that the significant others 
know the youth hold (see Curry, et al., 1976 for a discussion of 
these issues) . Additionally, career orientations probably ex- 
hibit reciprocal effects on each other ;^ for^example, ec^oational 
expectation probably affects occupational expectation, pnd vice 
versa (see Kerckhoff, 1971) . Consequently , the dynamicj model 
must permit reciprocal effects among these variables. 

In the next subsection, simultaneous differential equation 
models are proposed as a general methodology for expressing the 
dynamic, reciprocal 'aspects of forming career or iei^ations . The 
discussion is presented via a" simple example that parallels the 
example of a path. model reviewed in the present section. 

An Example of a Dynamic Model 

Virkually all structural-equation models of status attain- 
ment pro9esses have been linear, and the few available tests of 
linearity assumptions show only minor departures from'' linearity 
(e.g., Gasson, Haller aHd Sewell, 1972; and WiJ.son'and Forties, 
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1975) . Consequently, a first effort to translate current 
models into <iynamic tWQ-d'irectional niodels it Is sensible to ad- 
here to the linear postulate.^ s 



The simXrl^aneous , differential equation inedef assumes con- 
tinuous time and -continuous change in the endogenous (interven- 
ing and attainment) variables. There is l^ittle difficulty in 
conceptualizing the intervening variables ^s changing continu- 
ously; this is a major emphasis in the theoretical literature 
cited abov^. The attainment variablies, on- the other hand, ^an- ■ 
not be so easily conceptualized as changing continuously. Oc- » 
cupat^onal change obviously ppcurs in discrete jumps, and educa- 
tional attainment exhibits similar discontinuous change 
especially at the end of degree progreims. These difficulties 
cannot be resolved in this volume? Hence,- the illustration will' 
be confined to the exogenous variables and the intervening var- 
iables, and specification of a model including attainment var- 
iables will be, left for future work. 

^With the above con'siderations in mind, it i& now plausible 
to formulate in dynamic terms ti^E basic status attainment model'. 
To do this some additional notation is needed. Let the variable 
symbols, X. and y. be identified by the preceding list of yar- 

iabies^ let b. . and c.. be constants over time and over indivi- 

duals/ let dy^(t)/dt repJresent the instantaneous change' in var-- 

iable for a small change in time, and let u^|t),.be a distur-- 

bance term at time t* The. dynamic model can now tSe written in 
the following form: . . 

C4c) dy^(t)/at - 4^ ♦ «3jX^. ♦ ^ ^jiVi^t) ^ ^32^2^^^ * ^33X3(1) ♦ b^^y^ Ct) ♦ b33y5(t) ♦ M^it) 

(<4l) dy^(t)/dt: * ♦ «^^x^ ♦ m^^a^ ♦ ^41^1^^^ * ^42^'2^^^ * ^43^3^^^ * ^44^4^^^ ^ ^45^5^^^ * ^4^'^ 

It is irtiportant to describe how these equations reflect the^high 
ly generalized hypothesis that formation of career expectations 
is a continuous process^ and the somewhat more specific ^rgu-- ^ 
ments tjiat teciprocal effects may be*observed ait\ong the l^oSess 



1. Clearly, this argument must be viewed as an analogy, for, as 
noted in the preceding section oJ^ the text, the cross- 
sectional path coefficients and 'the parameters of the 
differential-equation model are not equivalent. I wish 
to thank Professor Robert Leik for bringirig this point 
to- my attention. 
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variables. The fact that the dependent variables are instarvtan- 
eous change rates {dy£{t)/dt] with respect to time reflects the 

continuous nature of the process. Ob serv^ig aquation {4a), one 
can see that the instantaneous change in/academic p^rformanoe 
at time t [dy, (t)/dt] is a linear combination of the exogenous 

variables, SES (jc^) , and mental ability (x^) * and of. all the 

.current values of the intervening variables [y^ (t) including ^ 

the current value of academic performance itself [y^^Ct)]. The 

second equation (4b) indicates that the change in significant- 
other educational expectation for ego [dy2^(t)/dtl ii^ al*so a 

partial function of the' current values of all the intervening 
variables. Hence, change in acad^ic performance * is affected 
by significant-other expectation pf ego, and change in 
significant-other expectation for ego is af;fected by academic 
performance. This type of reciprocal pattern can be observf*d 
among all the intervening variables. On the other hand, the 
exogenous variables are assumed fixed and unaffected by any in- 
tervening variable. 

Stated in the differential equation form, these hypotheses 
cannpt be tested against data, since instantaneous change rates 
cannot be observed. Consequently, it is necessary to integrate 
the system in order to find ^the relationship between observable 
variables at two "points in time separated by a finite time in- 
terval. The integration procedure is^ described in Chapter 4. 

It should be emphasize<^ that equation system (4) is only 
an example. Technically, it is termed first-order ordinary 
simultaneous linear differential equations with con'stant coef- 
ficients. The term first order means th,at no second or higher 
order derivatives appear in the equations.^ The term "ordinary" 
means that change rates are taken with respect to only one var*- 
iable, time . The t6rm simultaneous means that all of the rela- 
tions must hold at the same time — thus the equations express 
the concept (of a system of variables. The term linear means 
that the algSebraic form of the ^ equations is linear.* Finally, 
the term constant coefficients means that all the b^ ^ are con- 

stant over time." Each of these features forms a part pf the 
implicit assumptions of the model. These assumpt_4-ons are fair- 
ly restrictive, biit it seems r^sonable to start with a simple 
model and add complicating features as empirical, results indi- 
cate. The poin^t is that by suggesting this model, exploration of 
the wealth of available technology for describing dynamic sys- 
tems of career-expectation variables has only begun, , 



2. The concept of first, second, and higher order derivatives 
described in Chapter 3. . 



Interpretation and Dis cussion 

This section is divided into three subsections,. The first 
subsection discusses the manner in which change over time is in- 
corporated into the condeptualization o% the differential-equation 
system, l^he second subsection reviews the use of differential 
equations for* forecasting (prediction) . The concept of equilib- 
rium and oscillation are introdu6ed briefly. The^ final subsection 
discusses the relationship between the change coefficients 
(a^j and b^^) in equation system (4) and the path coefficients 

(a|j and bj^) in equation ^lystems' (1). 

m 

Conceptualization of Change 

Consider the dependent variables in equation system (4); in • 
each specific equation the dependent variable is a rate of 
change, dy^{t)/dt. The numerator, dy.(t), stands for a change 

in the variable y^ over a very brief increment in time. 

dy. (t) = y^(t) - y^ttQ) • • • . ... 

.1-^0 

Where y^(tj^) is the value of y^ ^t time t^ , ^i^^o^ value 
of y^ at time t^, and lim t^^ t^ means in the limiting case as 
t^ and tg are very close to being the same. Similarly, the de- 
nominator, dt, stands for a very small increment in time. 

lim * ■ 

^1 ^0 



Putting thefee two definitions together, one sees that the depen- 
dent variable is the ratio of change in y to change in time, as 
the change in time approaches, biit never quite reaches , zero . 

The hypothesis stated in the example is that this z^ate of 
change is a linear function, of the exogenous variables {"SlIS, MA) 
and the current value of all the intervening variables ^infcluding 
the intervening variable whose change rate is the dependent var- ■ 
iable. As -indicated above, this particular functional foiiin is 
hypothesized by analogy with current path models of the process. 
There are several • obvious generalizations of the example in 
equation system (4). The simplest generalization is to permit 
the coefficients Ca. b. .) to be functions of time, while main- 

•^j ■ / 

taining the linear form. Thus, for example, if a^^. were a 

\ IS. 



positiv>e numjaei^but |;ercied to move to zero ov^ time, then one 
would say that the influence of parental status on yo\ith's-oc- 
cupational status expectation declines over the high school \ 
yearS. ^k^re generally, the change rates car> be 4jypothesized 

to be arbitrary nonlinear functions of the current ' levels oi 
the" variables and. of time. These, generalizations will not bj6'- 
considered further in this volume., •, 

•« 

, r 

, ' . Forecasting , 

Given the hypotheses in equation system- (4) and •observations 
at two points .in tirae for a sample of individuals, it is possible 
to estimate all parameters of forecasting equations. Foreeast:s 
then can be expressed as functions of .time- alone. Procedures for 
determining numerical results are described in chapter 4 . Once 
these forecasting functions are determined,, a predicted value for 
each intervening variably. for each ^individual can be ^g;enerattc" ' 
for any point along- 'a continuous' time scale, prior to collecting 
data for the third ai^ successive panels.. Thes^' prediction's, \ 
then, can be viewed a^ predictions derived fgc^tTtheory of the ^ 
manner in which a system of careej:* planning variables operate 
dynamically over time,. The advantages of the built-in^ prediction 
formula are substantial. ", First, it provides the technique and 
justif icr'.tion for predicti^ tests of the thebretical model of 
status ' -.tainment processes "th^t heretofore -have not been carried 
out. F work with longitudinal data has been cqnf ined« to what 
might be --rmed "post-dictiOn , " that is,' the valiies^of the^.pre-' 
dicte.d -vari^les have been used to help estilrhate the regression 
equations. Since the ^egression coefficients are* chosen , eX post 
f ac± o , 'to maximize the accuracy hf the ^estimates^f '•.the dependent 
variabi^s {in ordinary least squares) ijfc^ is not surprising \p 
find that moderately ■ accurate estima tire can be made. , Projecting 
predictions pt^ior to observation pf ^the predicted variables^ 
could, result in mean-square errors exceed in^^-tiie cross -sectional 
.variance of the de]^endent var-i,ables, but the cross-sectional " 
variance is the maximum -error variance for po stdict ion' using OLS . 
Hence, the predictive test h^ a much strong^ chance -of -failure 
and -thus constitutes a much stronger /test pf the model (sree ^ . 
chapter 5.) • ' , . 

In addition , ^ the contiifiuous-time capability of the pre- 
dictive model permits the measurement in^r.vals to .vary . A 
period of one year might elapse between the first and seconds 
panels and the projections could be made for. one an^ ^ half ^ 
years,- two years, or any, time period beyond the second panel. 
This feature facilitates .coraparifeons between studies u«ing dif- 
ferent measurement intej^yals,- and als6 provides researchers freer 
dom to determine measurement intervals to suit their needs. 



It has been observed that a path analysis based. on longitud- 
inal data could also be. used to generate predictions.',- Certainly 
path analysis (or regression analysis) could b6 used in this way, 
bu^ its applicati'on is, seriously Idmitad^when compared to th^ 
differential equation model. Based on the path - analysis alone," 
any predictions would have to be made over a time iritesrv'al of 
exactly fthe sai.te .length, as the length of tim^ betweerr the mea- 
surements on which the estimates of the pain coefficients were 
calculated. For ' xample, if measurements were baken on high 
school freshman and again on the sanie individuals when* they be- 
came jujii,prs/- there^ is no mechanism built into the theory of 
path anaS^'sdV by itself to permit predib.ting to the senio;r year. 
Preditrtaons would -necessarily be^^to the year following the senior 
year, since estimates of path coefficients were based on a twc 
year interval . Similarly, if the first two measurements w^re 
taken during th^ freshman and sophomore years, predictions woyld 
be to the juhior year. Predictions to the senior- i&ear based on 
path-analysi|^ would be either fb,") ad hoc, or '(b) depend on . 
tending the ^eory of path -analysis ^ It seems unl-ikely thatr 
many researchers would extend p^th-analysis theory in* the required 
manner^ .• Further , the WxHiension would generate a special case o^ 
the forecasting formula derived from differential equations with- 
out the corresponding- conceptual benefits and flexibility of the 
differential equations . ** - , 



Interpretation Of Effects 

The change coefficients,, a. . and b. • in -equation system (4), 

. ij ±j • . ^ ■ -1 

can be usedNtp indicate the effects of. the intervening variables 
on each othbr (b. .) and ef f ects'of -^he exogenous variables on 

t?he intervening variables (a..). For example, a^. indicates the 

'effect of parental SES on the rate of change in occupational- 
status ^pectat-ion of youth. Perhaps of more interest, the b 
coeffic^nts can be used' to assess tfte relative magnitudes of 
two-directidnal effects. TO illustrate, b^2 indicates the in- ' 

stantaneous/ effect of signif icant'^ther educational expectation. 

3. . This point was made in a private dommilnication from Prof;essor 
- Robert 'Leik. * . " 

4'. The extension undoubtedly would fallow a line of reasoning in 
. which the path coefficients- calculated" over , say, a one 
tyear interval were applied successively to pr.edicted 
values, starting one year, after the second measuremeht 
used in calculating th^ path coefficients. A general . ■ 
prediction equation' could *t^en be developed by induction 
. • in a manner paralleling the theory of Markov chairv^ 'oyer 
• . 'discrete t^me intervals.-- s6ch' a developit^ent depends , 

\ however, on a model in which the jsame endogenous variables 
^* are measured at both time points, a pract^Oe not -frequ^nt- 
ly followed ^n*empirical studies.' *^ , ^ 



for ego oh the rate of change in ego's own educational expecta- 
tion; conversely shows the impact of ego^s educational ex- . 

pectation on significant others* educational. expectation of egp. 
V • ^ ; ^■ 

As 'conceptualized here, comparison of effects of one vari-* 
able to effects of a second variable on a given dependent rate * 
of change is awkwaord, be^cause the Independent variables may be " 
measured in scales that are quite different from/ variable to 
variable^ In path analysis th iff problem often ii^ resolved by 
transf orxning all variables to a scale with zero mean auvd unit 
variance, thereby rendering rough comparispns i^etween coefficients 
feasible. The simple strategy Of transforming all variables to 
zero mean and unit varialce when the same variables -are measured 
at more thari one point in time is not advisable however, because 
the strategy artifically removes change over time in the mpaii^and 
variance from the data. Methods of standardization that avoid 
this problem will be developed in Chapter 4. 

Comp arison to Ci oss-Sectional Path Cocjf f icients . In the . 
preceding discussion it was ^ emphasized that cross-sectional p^th 
coefficients measure differences between individuals at a <given 
time point rather than changes over time. Under certain circum- 
stances^ however^ the crosG-'sectional paths (a^^. and b^.) yield 

useful information about the change coef f icientfi^{a^ ^ and ^^j) • 

If the' system of career development variablesMequations (41) has 
reached an equilibrium state, i.e., stopped changing, then the 
cross-sectional path coefficients and the change coefficients 
for a given equation arfe equal, up to a constant of proportion- 
ality . This means that each pat'h coefficient in a given equation 
ca9 be multiplied by the same constant to yield the corresponding 
change coefficient (Coleman, 1968). Unfortunately, the constant 
required to convert from cross-sectional paths to change coef- 
ficients cannot be calculated from cross-sectional data. This 
inter'pretation , of course, depends on specifying the path model 
so that it permits nonzer'o paths everywhere that nonzero change 
coefficients are permitted (e.g., if the differential equation 
model permits nonzero b.- and b . . , the path model must permit 

nonzero b|j and b^^) • It should be emphasized, however, that 

even in equilibrium, comparison of path 'coefficients between equa- 
tions IS not permissible, since different conversion constants 
are required for each equation. For example, comparing b|^ to b^^ 

i's permissible, since these two coefficients appear in the same 
equation, hut comparing b^-^ . to b^^ is not permissible. Compari- 
sons between equations would be appropriate only under the as-- 
sumption that the instantaneous effect of the level of each var- 
iable on its own change rate is 'the same for -all variables ^ 
(b^2^ ~^ ^22 ^ = hjj , where there are^ J equaticJ^s) . This is a 

restri\:tive assumption, so^such comparisons - should generally be 
avoided. 
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\ Comparison to Cross-Lagged Path Coefficients . Suppose twtx 
Observations r for each individual and each endogenous variable 
were available, and a path analysis were executed in 'which the. 
tim^-one observatidns .on the endogenous variables were among the 
independent variables and time-two observations were the depen- 
dent variables. Define all the same endogenous ^variables as in 
the previous examples, and let y^(0) be the observation for y^ 

at the beqinning point, ahd y.{l) the corresponding observation 

at time 1. A Cross-lagged path -analysis might be written as 
follows: . / - 
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(jee, e-.g., Heisc, 1970). Generally, the coefficients a^. and b^. 

ij -^j 

are interpreted as indicators of 'the magriitude"" of effects of one 
variable, on another. But such interpretations of the cross-lagged 
path coefficients calculated over* finite time intervals must be 
made with caution. The magnitudes of the B* coefficients depend 
on the length pf time interval between measur-ejnents ; further, 
the relative magnitudes of different b* coefficients also depend 
on the length of^ the measurement interval. This means that the 
relative effects of two variables on a given process variable 
(b^ . vs b* . , , ',j ^ j') depend on the length of the measurement 

interval, and>the relative magnitude of reciprocal effects de- 
pend on the length of the measurement interval. Even the sign 
of the b* may depend on the. length of the measurement interval.^ 

this observation will be justified in Chapter 4, but it 
should be noted here that the statement depends on the assump- 
tion that the differential equation system really does describe 
.^the process. 



Summary 



. This chapter has developed the substantive rationale for ap- 
plying differential equations to the study of development of 
carreer expectations. The conceptual f rameworK is drawn from 
status-attainment research in sociology, but" important -theoretical 
insi^its from o-fher research traditions were refGrencGd alsSo, and 
included, in the model where possi,ble. 



5. Doreian and Hummon (1974; 1976) make similar observations ^ 
about a single -e'qaation model of status attainment 



processes . 



4 U '% 



The first section of tihe chapter r^viewsi. ,ti>e status 'attain- 
ment theory and presents "^an illustration df a path model bf this, 
theory. The path model shows status background and mental" aBil- , 
ity- affecting career-expectation variables. The care^- 
expectation "variables affect career attainments. The 'path modfel 
of this theory is a set of linear equations ^ describing cros?- - 
sectional differences among individuals rather" than changes .over ' 
time. /'r^— ' ■" ■ ..'^ 

The second section present^s the theoretical basis^ f o?^' trans*-- 
lating the cross-sectional path model into a dynamic model ''pep- 
resented by a syst.em of linear differential equations,. Two points^ 
are made. First, theory suggests that career expectations de- ; v 
velop* gradually over M::ontinuous tipie. Secohdlyi i^iost of the 
career-expectation process variables^ probably exhilsi^it two--way v 
effects on each other. Neither *of these importa^^it features df 
career expectations'^ are expressed byfcross-sectional path models'. 

The third section presents an example of 4 a differential 
equation" model to express the dynamics of developing carreer ex- 
pectations and two-way effects among, career-expectation process 
variables. The differential* equation model repifesents the rate 
of change oyer time in career-expectation variables (e.g., 
status level of occupational expectation) as 'linear functions of 
background variables (family status, mental ability) and of cur- 
rent levels of the career-expectation variables*. ^ Career attain- 
ments are omitted from the model because thev r.hange abruptly 
at isolated time points rather than continuou^Mv over time. 

The fourth section discusses interpretation and appligations 
of the differential-equation model.. It is noted that the model 
refers to change at a point in time rather than over a finite 
interval of time. Use of the model to test theory by forecast- 
ing is emphasiged. It is, argued that accuracy of forecasts * 
comprise ^ stronger test .of theory than the usual post facto ^ 
regression analyses 'in which correlatio;^s ^are used to index the 
accuracy of the model. In addition^ interpretation of the 
change coefficients associated with the differential equation 
model is compared to interpretation of cross-sectipnal and-^ross- 
, lagged path coefficients. It is noted that cross-sectional paths 
generally do not coincide with the change coef f icein^s but do es- 
timate, change coefficients up to a constant of proportionality 
if change in the system has ceased. Perhaps of more interest, 
the cross-lagged path coefficients calculated from two panels of 
observations on the same process variables depend on the length 
of the interv "f time separating the^^twq panels. In contrast, 
thq change co^. ,cients do not depend on the length of the mea- 
'surement interval^ assuming the differential-equation model des- 
cribes the process under study. It is therefore recommended that 
interpretation of cross-lagged path coefficients proceed with 
caution and with the realization that the coefficients do depend 
on the length of the time interval separating panels. The change 
coefficients associated with the differential equation model are an 
alternative set of coefficients that researchers may wish to de- 
pend on for interpretations of effects. 
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. ^ REVIEW ^OF SELECTED^ MATHEMATICAL AND STATISTICAL 

.CONCEPTS AND RELATIONS 

I 

Thorough understanding of' parts of this volume probably 
requires some previous exposure to calculus and td linear alge- 
bra. To reduce the burden on the reader, however, this chapter 
reviews some basic .concept's and relationships in mathematical 
and statistical theory that are used repeatedly in the remain- 
ing chapters. The review is cursory and highly selective. In 
most instances, basic'concepts are sum in an intuitive 

manner, important theorems are illustrated numerically, and the ' 
theorems are stated in brief form without proofs. There are 
four main topics. First, elementary concepts and formulas" in 
calculus are sununarized. Secondly, complex numbers are dis- 
cussed briefly. Thirdly, concepts and formulas in linear alge- 
*bra and matrix equations are covered. This review is somewhat 
jnore extensive than the others because of the strong dependence 
of much of the material in this volume on theorems from linear 
algebra. Finally, st very brief and selective discussion related 
to inferential statist;^ics is presented.^ 

f 

Elementary Concepts and Formulas in Calculus ^ 

^. . > 

There are two major branches to the study of calculus the 

differential/ calculus and the integral calculus. A selective re- 
view of the differential calculus is presented first, then the 
integral calculu"s is reviewed. These reviews are not rigorous 
presentations of the math^atical theory. Rather, they summarize 
very briefly some basic results needed in the remaining chapters^. 



The material in this chapter is common knowledge or readily 
derivable from'co^nmon knowledge in the math^atics or 
statistical 'literature. Numerous reference sources 
were used to assemble the information contaihed in the 
chapter. These include Goodman (1969), Freund (1971), 
Hohn (1972), Yamane (1968), Lancaster (1968), Fisher 
and Ziebur (1958), and Piatt (1971). | Readers desiring 
more rigorous and complete presentations of the theory 
are referred to these sources, among many others that - 
are available. 
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J Differential Calculus 

The dif for eatiad* calculus is {the study of slopes of con- 
tinuqus functions, at isolated points on the curves defined by 
the function^. The^^slope of a straight line defined by 
y = a + bx is familiar; the slope is .the 
change in y as a ratio to the change in x- 

two values, of x, and let y^ and y^^ be the corresponding values 

of y determined by the linear equation, i.e., y^ = a -i-AiX^, and 

y-^ = ^+ bx. 

change in 



"rise over run" or the 
Let "?£q and x^ 'be 



'1' 
y by 



Denote the change in x by Ax = x, - 

y 



a + .^x 

X. 



0' 

and the 



AY =■y^ ' 



0' 



The slope is then defined by 



Xi - Xo 



(a + bxi )■ - (a + 



xi - 



Xo 



bxo ) _ b(xi ~ 
Xi - 



Xq ) 
Xo 



^x 

-AX . b .-, N - 

• AX 

This is a familiar result, illustrated in the previous chapter;' 
the- slope of a straight line is the multiplier constant b in 
the linear equation y = ja + bx • 

For ^a linear equation, this result. holds irrespective of 
the magnitude o£ Ax, but for a curved line, the slope does de- 
pend on the magnitude of ax* To see^ this, observe the parabola 
In figure 2, defined by y = a + b;x^ . 




•3 



,0 
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Figure 2. Slopes .of a Curved Line 

• 

The graph is constructed with, a = .5, and b = •2. Assume for 
example that one wishes to calculate the slope of the line de-- 
fined by contiecting the ^points on the graph corresponding ^to 
X = 1.5, and x = .5. For x, = 1.5, y, = + . 2 (1 . 5) ^ = 

For( Xq = .5 , y^^ = .5 + .2 { . 5) ^ = . 55 . Hence, Ay = y^ - yQ - 
..95 - .5^ = .4.^ Alsp, Ax = ~ ~ ^ ~ ~ ^' '^^^ slope, 
theref^e, is: Ay/Ax = ..4/1 = .4. This slope is represented by 
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s. on the graph (figure 2) . To show that this slope does not 

remain fixed for different anKJunts of change in x, carry out a 
^second set of calculations with x^ = 2, and Xq = .5 One finds: 

y = .5 + .2{2) ^ = l.q 

' ^ ^ = .5 +' .2 ( ^ = .S5 

= y - y =1.3 - .55 * .75 
x = x - X = 2- .5-=1.5 

1 0 \' - . 



= .7 5/1.5 = .-1 



I Thus, slope = Ay/ax = .7 5/1.5 = ."5 f .4 

The following point has-been illustrated: From a given start- 
ing, point, a change of one unit in the independent .variable of a 
nonlinear function generates A different slo^ than does a change 
of one and a half uni^s. 

» It may be useful to contrast this exampll to a parallel set 
•ofi'' figures for^a linear function. Consider the -linear function 
=""a + bx with a = .5 and b = .2. Note that the constants for 
this illustration are the same as for the previous example invol- 
ving a nonlinear function. It was just shown that the slope of 
-the linear function, y = a + bx, is b. '''In the present case, 
slope = b = .2. To illustrate this fact, take/the same points 
for -x which were used to illustrate the behav/or of the non- 
linear function. For x^ =1.5, and Xq = .5; 



= y - y = (.5 + .2(1.5)) - (.5 + .2(.5)) - .2 

Ax =1.5-. 5=1 ' ^ 

slope = Ay/Ax = .2/1 = .2 

For the second p^|iir, of x values used in the nonli*fiear example, 
Xj^ = 2 , and Xq * .5, one has 

^y = y - y = (.5 - .2(2)) - (.5 - .2 (.5)) = .3 

1 0 • ' / 

J 

Ax - 2 - . 5 '= 1 . 5 . ■ ! ^ ' ' 

slope = Ay/A5< = ^.3/1.5 = ^2 

As illustrated, the slope is constant for the linear function and 
equals the multiplier constant, b^= .2. . ; 

V 
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It can also be s6en that for the nonlinear function, the 
^ slope is a function of the starting value of x as well as of i^x. 
Fox example, let Ax = (-.5) - {-1.5} - I. Note that ax « ^ is 
the saroe as for the first example with the parabola. For the • 
first example, however, the slope = Ay/Ax = »4, but a quick cal- 
culation for the present case shows that the slope is -.4. Sim- 
ilarly, for A» = (-,5) - (-2) = 1.5, slope = Ay/Ax = -.75/1.5 = 
-.5. 'In th« second example with the parabola, the change in x 
was also 1.5, but i the slope was a positive one-half rather than 
negative one-half^ 

The differential calcu-lus is the study of how slopes of 
continuous functions depend on x as A^^ goes toward the limib of 
zero. Consider now the function y = a + bx^ algebraically rather 
thar:i graphically. The slope can be written 

A y =A {a -H bx!) - (a -i- bx j ) 

A X * ,A X 

• ■■ ■ V 



= b(xf - x|) 

Xi - Xo 

\ 

= b (xi + Xo ) ^1 - Xq ) 



-H Xo ) 

X^ '- ^ j 



= b (xi^ Xo ) 

In the limit as A x 0, x, -> and this result can be written: 



-A-)- = 2bx = 2bx = 2bx 
AX 0 1 

lira 
A 5c 0 



This is the derivative of y with respect to x for a parabola of 
the form = a + bx. In general, the derivative of y with re- 
spect to X is denoted by dy/dx , As can be seen by the example, 
derivatives do, in general, depend on the value of x* 

The general ^definition of the derivative can be expressed 
as follows. Let y be a continuous function of x. Denote by 
y{a) the value of y when x = a. Now/ by definition 



dy ^ y (x Ax) - y (x) 

dx . . Ax- 
lim 

Ax 0 
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In words, the derivative of a continuous function of x is the 
slope of the function at a specified point on the curve. ' This 
is not a rigotous definition, but it does offer an intuitive 
statement of the raain idea. 

The study of the differential calculus largely is concerned 
with finding algebraic expressions for the derivati.ives of alge- 
braically expressed continuous functions.' Some of the basic 
results needed in the remaining chapters are given here without 
proof. It is assumed that y and z are continuous functions of x, 

1. _d (a + by) ^ d^ 

dx dx 

2. dy^ n-1 dy 
dx ^ dx 

* 

3 . d_(y + z) = d^ ^ 

dx . ..dx .3^. 

^ - ^ g^' y > 0; In y stands for the natural log of y 

5. dv2 ^ dz ^ 2 dy 
dx - dx dx 

y 

^' . = e^' ^ : e is the base of the natural logarithm 

dx dx 

7 . d sin . y _ dy - 

8.* d cos y . dy 
. -dbT"^ = y di 



7". Certainly, not all functions can be expressed algebraically. 

For- example V a string may be spread in a curved fashion 
across a pair of rectangular coordinates. So long as 
no line perpendicular to the ' horizontal axis crosses the 

• . * string twice, the set of points defined by the path of _ 
the string forms a function. If there are no /kinks" in 
the string, it forms a continuous function and hence, is 
dif ferentiable? yet it is unlikely that any algebraic ex 
presiiion describes the path of the spring. ^ It is, per- 
haps, a curious fact that the deriva-tives 6f many func- 
tions can be expressed algebraically ^even though the 
functions have'no ^Igebrajc expression.^ 
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In ail these cases, the fact that y and z are continuous functions 
of ^ is important to a proper interpretation of the formulas. , 
First, illustrate the fonnulas in which z does not appear. As- 
sume the simplest possible function between y and x by setting 
y = X* For this instance dy/dx = 1. Thus, .for example, formula 1 
simplifies to .\ 

4 

(a 4- by) ^ before ^, 

Formula 2 becomes 

dy dx n-1 
dx dx 

Formulas 4, 6, 7, and 8, respectively, simplify to: . 

d In X ^ 1 • ^ ■ . . 

dx X 



d e _ X ,^ 
dx ^ 



"^ d sin X 
dx 

d cos X 



= cos X 



dx 



= -sm X 



To illustrate th.e addition formula 3 and product formula 5, 
l^t y = e^, and z = In x. From formulas 6 and 4, respX^ctively , 
one finds de^/dx = e'^, and d In x/dx = 3-/ x > 0. Hqnce, the ad- 

dition formula specializes to 

_d (y + z) = d (e'' + In x) = e^ + p x > 0 

dx 5x ' • 

Similarly, with these two functions (y, z) , the product formula 
becSjmes ' . ^ • 

, , X , , X d In X , de 
dyz = d(e In x) = e + In x ^ 

dx dx ^ 

^ e ^ + (In x) « ' ^ 

= c^'i- + In X) * ' 



To illustrate a more complifcated case than y = x for the 

lias not involving 
for example, indicates 



2 

formuias not involving z, set' y = ^x ^ In this case, formula i. 
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d.Ja f ^ = b d^x^ « bK 



3 



Attention now shifts to defining higher - order derivatives. 
Consider a simple function and its dej^ivative: 



dy/^x .= (3/6)k^ =Ui/2)x^ 



Let 
can 



(rule 2) 

z = dy/dx = isx^i then dz/djT = Je (by rule 2 again)/, 
be conq^ptu^lized by the . fo^lowin^ notatipn. 

dz ftd jdY^ -A - • 



This 



dx ^dx 



d 




r 



Now, ^incQ dz/dx is the fur^ction tief 
y = (i/6)x^ twice, it is termed the ' 
derivative o^ the second order 



X: comm< 
eraliy, 1 



by differentiating ~ .« 

second derivative of y or the 
common notation. for this is 



d (1/6 xM/dx^ = x. M<ke general^, let f(x) J?e a smooth function 

of X, then its nth^order derivative i^s dertoted by d f(x)/dx J 
It is defined by d'Xf f erentiating the fiyactio^ f (x) n ti^es .y ' 

The Integral Calculus ^ 

There are two types of integrals , — the indefinite integral 
(or antider ivative) , and the definite integralT It is important 
to maintain the conceptual distinction betWfeen the two integrals 
Consider. a continuous function of x, f(x). The indefinite inte- 
gral of f(x) is defined to 6fe a second continuous function, say, 
F(x), whose derivative equals f (x) . The indefinite integral of 
f(x) is .denoted by the symbols /f(x)dx.' Let 

/f(x)dx = F|x) 

th.en by definition 

£(: 

dx 

^ The definite integral is somewhat more difficult to define 
intuitively^ but it can be interpreted as the area under a con- 
tinuous curve from one specific point on thor x axis 'to a second 
specific point on the x axis. For example, *the shaded area in fig^ 
ure 3 depicts the definite integral from point a to point b. 



Figure 3. Area Under A Continuous Curve 

An approximation of an area, under a curved line can be found by 
adding the areas of narrow strips under the curve, as xllustrated 
tT\ - f igure 4- -.- - 




.•Figure 4. Approximations to the Area Under a Smooth Curve 

I 

Note that the area A of each rectangle is giv^n' by' the product of 
its height and width. If its height is y - f(x) ijnd width is the 
change in x - - Ax, then A = 'f(x) Ax. ^Intuitively, it is clear 
that in the limit as Ax goes to zero, the sum of the areas of the 
rectangles converges on the area under the curve. The definite 
integral from a to b of f('X) is denoted by- 

i 

f(x')dx .. , 
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It can be viewed heuristically as the sum of the areas of "narrow 
strips whose heights are f (x) and widths are Ax, in the limit as 
AX 0. ' . ' • ■ 



Now, let Fix) be the ^indefinite integral of f{x), so that 
F(x) = /f(x)dx 

One of the most fascinating and useful facts in mathematics is 
the connection between the indefinite integral F(x) and the 
definite integral, to wit, - 

/f f(x)dx = F(b) - FCa) 

• , r 

This is known as the fundamental theorem of calculus. It says 
that the area under a section of a continuous curve , that is, 
the definite integral ^ can be calculated by finding the dif- 
ference between the indefinite integral evaluated at the end 
■points, a and b/ of the section. 

/ 

The integral ' calculus is the^ basis for translating hypoth- 
eses confcerning instantaneous changes over time into forms that 
can be studied eifipir:ically . Since such translations are the fo- 
cus of the differential-equation model of career expectations, 
it is useful to see how the translation can be achieved. Sup- 
pose one forms a hypothesis about changes in y over time: 

^ - f (y,t) 
dt ^^y^V' 

A derivative cannot be observed sZnce it is "a slope over an in- 
f initcsimally short period . Let \it) be a function of t such 
that F(t) is the indefinite integn^l of f(y,t), and note that - 

- - Y^- Taking the definite integral oh both sides of 
dy = f{y,t)dt, therefore, yields 

■HP 

» 

whore t , t. are two specific time points at which y - y^ aVid 

a 13 ^ a 

y = y^^, respeptjively , The equation involving derivatives has 

been eonverteci into one in which all alem^qts ^re observable. 

One can view t at a fixed initial time poi^K; and let t. vary? 

a ^ 

in these circumstance y is a fixed initial value of y and y. 
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varies >^ith t^. Hence, from th^e ihitial hypothesis concerning ^ 

continuous change, a function giving y in terms of t has been 
derived. The function can be used in conjunction with dat^. 

The above result was derived with the definite integral. 
TUe san^e outcome also can be derived using. the indefinite inte- 
gral. Since it is sometimes more convenient to use the indefi- 
nite" integral .the alternative derivation is presented. Again, 
consider the dxt-ferential equation: dy/dt = f(y,t). 

^ • ' . ; . • ■ 

dy/dt = f {y,t) 

" dy - f Cy , t)dt • . _ • i 

. /dy = /f <y,t)dt • " . . . ' ' 

Suppose that FCy/t) is, a solution to the differential equation," 
so that, by 'd.efinition, dF(y,t)/dt = f (y,t) , If dF(y,t)/dt 
f (y,t) , tihefh %o does dlF(y,t) + c3/dt = fiyvt), where, c is any ^ 
constant ever t. This is an essential aspect of integration; 

sometimes is called the constant of integration. Applying 
this concept, one "finds: ■ . • 

/dy = /f(y,t)dt ' 
y F(y,t) + c 

To check this ^resulfV^ differentiate both sides with respect to t 

dy/dt = dlF(y,t) + c]/dt 

!= dF (y,'t)/dt. + dc/dt 

= f (y,t) + 0 • . ■ 

/ 

since dF(y,t)/dt = f(y,t) (by construction^, and the derivative 
of a constant is always zero. 

For empirical work, the difficulty with the result *that' 
y = F{y,t) + "C is that c is an unknown constant. To find c one 
needs an observation on y at a given time point, say t^. With 

the observation point (t^,y.5, one has 




Puttinq this result into the indefinite integral yields the pre- 
vious conclusion based on the definite integral: 



y * F(t) +• (y^. - F(t^)). 

Sometimes an alternative notaf'^i^n * fbr finding a definite 
integral, ovet a specified interval isfconvenient. In the nota- 
tion -used so far ^ ' ■ 

• * 

t. t. . , 

\J.'dy = {. f {y,t)(?t = F(t^) - F(tj^) 

^ ^ * ^ ' y \ 

Suppose that one desired to denote the initial time point by 0 
{time zero) and the second tlm6 point by • The notation 

}'Qf{y^t)dt is an- awkward notation .tp represent the desired re-- 

si>lt because thp symbol t is used ii\ the same exprisssion to in- 
dicate the end point of the interval and all the points in -be- 
tween . A much clearer notation is : ' - 

, t t 

/dy - /f (y,T)dT = F(t) 7 F(0) 

0 0'' ' 

Now, t denotes the end point of the interval and t indicates 
the .variable which assumes all values in the interval ( 0 ,t ) . 
The variable here denoted by t * is sometimes called a dummy var- 
iable^, but it should not be confused with the same t^x^ used to 
indicate representations of categorical variables regression 
ar^alygis* \ 

A large part of the ^tudy of the* integral Qalculus entails 
finding- algebraic expressions for ^ indef inj.*:e integrals. Several 
of" these results are liste'd without proof below, for reference " 
in later chapters. It is presumed y and z are continuous func- 



of 


X , 








■ / 


bdx = bx + c 




2. 


/ 


" n, . 1 n+I 
X- dx = ^x- -f 


c , n 7^'-l 


3. 


/ 


(y 4- z ) - / ydx 


+ / zdx + c 


4. 


/ 


^ '= In X + c ' 

X 




5. 


/ 


e'^dx = ^ + c 
dy 




6. 


/ 


cos X dx = sin x 




7. 


•-/ 


sin x dx = -COS' 


K + C 



where c is a constant- , ^ 



It should be emphasized that many integrals axist that cannot be 
expressed by an algebraic statement. In fact, even for functions 
given in algebraic terms, . existence of an integral expressable by 
an algebraic statement is the exception .rather than the rule._ • 
When algebraic expression of integrals cannot be found, numerical 
solutions are a viable alternative, by use of electronic computers 



Complex Numbers ^ " 

Some of the results of the section on linear algebra depend 
OR knowledge of complex numbers; hqnce, a very brief summary of 
the important ideas is given hfere. [ The concept of a compl^ex 
number is more meaningful if set in"' the coCtext of other n\!iitll5«r 
sets. The positive integers are discussed first, and the set of 
positive integers is gradually -expanded through the- real numbers, 
and, finally, the complex numbers. , 

Consider ^the number set defined by the positive integers: 

1, 2, " One can- add and multiply two positive .integers 

together and end up with a positive integer for the result. The 
positive integers therefore are said %o be closed in addition and 
multiplication. Subtracting , one positive integer from another 
positive integer does not necessarily yield a positive integer 
as- the result, howeyer . For example, 3 - 5 = -2, or 4 - 4 = 0. 
This difficulty can be overcome by def j'ping an expanded numbej; 
set, to include the positive integers, zero, and negative in'- 
tegers. This new number set is closed for addition, multiplica- 
tion and subtraction, but not for division. For example, 

6 f 2 ^ 3, Dut 5 r 2 = 2.5, which is not an integer. Expanding 
the number set to include rational fractions as well as all 
integers and zero generates a number set which is closed to the 
four arithmetic operations — .^addition, multiplication, subtrac- 
tion, and division. 

Even the rational fractions do not correspond to every point 
on a straight line. This may' be counter-intuitive; to see why it 
is true, consider the point p in figure 5 below. 




Figure 5. Real numbers and points on a line 

T' 
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The positively . sltspin^ line from the. origin to point B and the 
negatively.^sloping line from B to point p are at right angles 
«nd each is one unit in length. The problem is to. find the dis- 
tance froni the' origin to point p. This distance is the lengt h 
of the Vypotenuse^ of a right triangle. By the Pythagorean 
thoprem/ .tijerefore, the square of the length from the origin to 

p 4s 2 = 1 +'1 . The length, £, theref^ir§^ is 1 = /2T But the 
square root of two is .not a rational fraction, as shoim by the 
following arguCTtf'nt. 

2 2 _ 

Since Q = 0, 1 = 1,'and the square of any integer larger 

than i exceeds 2, try 'letting /2~~= a/h, where a/b is a rational 
fraction reduced to its lowest common^ denominator . If a/b is a 
rational fraction reduced to its lowest c'ommon term ,( all rational 
numbers can be sp represented) , then eith.er a or b must be an 
odd integer, because if they were both even, they would be divis- 
able by 2, and therefore, not be reduced to the lowest common 

) ■ 2 2 

term. Since we are assuming /2 = a/b; a = 2b ; hence, a = 2(b//2), 

thus showing that a .is an e\&d^ integer. Since a is an even in- 
teger, one can represent it 'by the product, 2c, where c is any 
integer. One finds 

2 2- 
4c = 2b • 

/" - 2 , 2 ■ 
/ 2c = b 

2 . * . ' 
Therefore, b is also an even integer, implying b is an even in- 
teger. Thus, the presumption" that a/b is .a rational fraction re- 
duced to its lowest common denominator is contradicted. It is 
concluded, therefore, that /2 is not a rational 'number . Yet it 
is a point -on the line, as illustrated in figure 5.",. 

• The set of numbers must be expanded to include some numbers 

other than the integers and rational fractions if all points on ; 

a line are to correspond to a number. The real numbers comprise. 

a set of numbers with a one-to-one correspondence to points jrh a 

line of infinite distance in either direction from th»vorigin. 

Yet all operations on the real numbers do not yield real numbers . 

2 .2 
as answers. For example, the ec^ation i +1 = 0 gives l = -1, - 

but no real number satisfies this relation, since the product of 

any real number with itself is positive. The last expansion of 

the numlspT set is the complex numbers. Complex numbers are writ- 

toi\ in the following form: 

^ X = a + bi , 1 ' . 

whore i '= -1 -is the "imaginary" unit. (The "symbol- = means is. 

defined by.) The real nitmber a is called the "real" part of xj 

and the real number b is termed the "imag' ary" part of^x. Th^ 

complex conjugate of x is denoted by x ana is defined i^yr " 

jT = a - b 1 ^ , . ' ' . 
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All algebraic operations are defined on the set of complex 
numbers in the same way as for real numbers, except where 

i2 is encountered it is replaced with -1. Thus,, addition, sub- 
traction, multiplication and division of x^ = )a-^ + b^i and X2 - 

a2 + b^i are defined as follows. 

+ X2 = + b^i +32"*' ^2^ " ^^1 ^2^ ^^1 ^2^^ 
- X2 = + b^i - a2 - b2i = (a^ - + (b^ - h^)^ 

x^X2 - (a-i ^ b^i){a2 + b2i) = a^a2 + a^b2i + a2b^i - b^b2 

= ^^1^2 " ^1^2^ ^ ^^1^2 ^2^1^^ 

(a^ + b^x) {a,2 - ^2'^) 
X^/X2= (a^ + b^i)/fe2 ^2'^ " U7^rT^Tl~ri7^T^TT^ 



- (a, + b, \ ) (a„ - b„i ) 



q - b^i^ 



=- (a^a2 + b^b^) - ia^h^ - a2b^) i 



2 



aE, + b| , since \ = -1 

Hence one sees that the complex numbers are closed in addition, 
subtraction, multiplication and division, since the results of 
these operations with complex numbers are, themselves, complex 
numbers. . ^ 

A complex number is a pair of real numbers a, b, and can be 
graphed as a point on a plane, as in figure 6. 



If 

/ 
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X = a + fat 



Figure 6. Graphical representation of a complex. number 



/2 2 

The absolute value of x = a + bi is defined by |x| = r = /a + b 
The angle 0 is termed the argument of x. Note ^hat x can be 
written in trigonometric form: 

X = a + bi = /{cos 9"' -f (sin B) i) 

since sin 6 = h/r , and cos ,6 = a/t . ^ 

For multiplication, division, finding- powers and roots, and 
exponentiation, -the trigonometric form is very useful. The fol- 
lowing results follow directly from the , definitions of complex 
numbers and the operations; they are listed without proof. Let 



X, = 



(cos + (sin x^ = r-^ (cos 82 + (sin 82)1)/ and 

X = r(cos 9 +. (sin 6)1), then: 

^'1^2 ^1^2 {sin (6^ + ®2^^^ 

x,/x^ = (r^/r^) {cosCe^ - + (sin (8^ - ^2^^^^ 



= r^{cos (n9 ) + (sin (nS ) i } 



.1/n ^ ^1/n ^^^g ^ ^ 2kir)/n3 + (sin [(6 + 2k-iT)/n]i} 
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where 0 is given in radians and k = 0, 1', .../ n-1 . Ther^are n 

1/n - ■ ^ 

roots to . Exponentiation and natural logs can be written: 

» e^ =. [ cos b + (sin b) \ J , . x *^ a + hi 
' In = Ih r + (6 + 2TTk) I 

♦ • 

where k is, any integer. ' 
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The exponential and logarithm of.comple* numbers enter* xnto sys- 
tems of linear differential equations in an important' way. The 
principal branch of In x is defined by setting k ^y^^ r 

In X = m r + Gi ; if k = 0 ' . ^ ' 

Hereafter , in this volume ^it is-- assumed that the principal branch 
of the logarithiA is to be used. „ 

Elements of Linear Algebra ^ 

There are three aspects of linear algebra and related matrix 
equations which are Reviewed here. Pisst, basic concepts related 
to systems of linear relations are summarized. Coverage includes 
the concepts of a solution to" a system of linear equations, rank 
of a system, and determinants. Secondly, characteristic equations 
are reviewed, including eigenvalues and eigenvectors of asymmetric 
real matrices. Thirdly, matrix func;tions involving 'eigenvalues 
and eigenvectors a?^ /'introduced . # . 



Systems of Linear 'Equations 



Consider t following linear equation in two unknowns, x 
and y. 1 ^ 




tion of Y; 

y - 1/2(3 - x) 
X = 3 - 2y 



The equation does not contain enough information to yield 
a unique value for x and y; any point along the line defined by 
y = 1/2(3 - x) satisfies the equation.- Now, add a second equa- 
tion so that a pair of equations must be satisfied^, r 

I ■ 

X + 2y = 3 ' / 



4- 



4 



2x + 2y = 4 • ^ 



If the first equation is subtracted from the second, thfe result 
is' X = 1. Setting x = 1 in the firSfe eqiiatiorf and solving for 
1;- y gives y = 1. There^ is no other p^ir of values x, y tl^a^ render 
. both equations true at the same time/ hende-,. X =^ 1 and y = 1 con- 
stitutes a " unique solution to the pair of simultaneous linear 
equations. 
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Suppose "^that, forXtije secof?& ec[uVtion, instead of 2x + 2y = 
one had 3x + 6y = 9. The new p^ir of Nequa^ions is 



X + 2y = ^* 
3x + 6y = 9" 



There is no way to solve this pair of equations to obtaiii unique 
values X and y. I*f one attehipts to remove y from the second 
equation by multiplying the firs% by 3 and subtract^in^ the re- 
sult from the second, both -x and y, are eliminated, and so is the 
constant on the right : 



^ 3x + 6y = 9 
~3 Cx ^ 2y = 3) 
0 + 0=0 

The reason for this result is that the second equation is a simple 
transformation of the first; it was obtained by multiplying both 
sides of the first equation by 3 . Hence, the two equations are 
l inearly dependent . • 

Now, return to the t^rrst pair of equations and add a third: 

X + 2y = 3 ^ 
2x + 2y = 4 

X - y = 2 < 

It has already been found that the first two equations imply that 
X = 1 and y = 1. But, if the third equation is subtracted from 

, one concludes that y = 1/3. Substituting y = 1/3 into 
equation gives x = 7/3. So , x = 1 , y = 1 satisfies the 
second equation, and x = 7/3, y = 1/3 satisfies the 
third . The three equations are inconsistent . The third 
equation would be consistent with the other two if it stated 
X - y = 0, or some multiple thereof. 



the first 
the third 
first and 
first and 



It is useful to represent these operations as. matrix equa- 
tions. A matrix is a rectangular table filled with numbers (or 
symbols representing numbers) . If A symbolizes the matrix , then 
a^j represents the entry in the ith row and jth column of A. The 

entries for which the row and column indc^xes are equal are called 
diagonal entries and the diagonal entries comprise the diagonal 
of the matrix. 



For example, if 



1 2 



- I 2 2 
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T^en a,, - 1, = = ^22 = ^"'^^^^ except the dia- 

gonal entries of a matrix arjB zero, the matrix is called a diagonal 
matrix The order of the matrix .Is the number of rows by the 
nS^ber 'of columns. A i^ a 2 x 2 matrix. If the number of columns 
ifone, the mat^rix is a column vector, if the number of rows xs 
one the matrix is a row vector . The transpose of a matrix A is 
defined by the matrix-T5uH3^n rows of A form the columns of_the 
new matrix. The transpose of A is dei^oted by A' or . If A - A , 
A is said to be symmetric. ■ ' 

Matrix addition and subtraction are defined as the sum/ 
difference of the individual elements. For example, C.= A + 
■then c. . = a. . + b^.. Matrix addition and subtraction are not d 
' fined unless^'the o^er of the; two matrices to be added or sub- 
tracted is the same . - • 

Matrix multiplication and division are not defined as. element- 
wise multiplication and division. If C - AB, c^^ f a^^b^j. 
Matrix multiplication is defined to represent, systems of simultan- 
eous linear equations. If C = AB, then, by definition, 

^ij '= ^l^lj ^i2^2j ^ ^ ^iAj ^ . 

where K is the column order of A and row order of B. 

' Note that, in general AB ^ BA. In fact, it is possible that A^ 
i-s defined when BA is not, ilnce the column order of " f ^^J^^ • ^^^^^ 
the row order ofB when the column order of B does not match the 
row order of A. For example, A may be a 2 x 3 matrix, and | a 
3 X 3 matrix." Consequently, it is impprtant to ^^^If^^ 
multiplication or postmultiplicatioh . ^^^^^i^^P^^^^^^^" °f 
A indicates AB, and postmultiplication of B by A means BA.' To 
be conformable for multiplication, the column order of the pre 
multiplier must match the row order of the postmultiplier . 

TO illustrate matrix representation of linear-equation sys- 
tems, reconsider the first pair Of equations among the preceding 
illustrations : 



X 



1 ^ 2x2 = 3 



2x^ + 2X2 = 4 

where x-, replaces the symbol x, and X2 replaces y. 
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-teSiT^e the matrix A and column vectors x and b, as follows 



A = 



1 2 

2 !2 



Now, thh pair of linear equations can be written compactly 



Ax = b 



Suppose one preraultip^ied both sides of this matrix equation by 
the following matrix 

as follows : ^ " 

d-U ! i^) (14) -(M ■ 

Carrying out the indicated operations yields 



1 0 
0 1 



X2 



1 
1 



Thus, premultiplication of the matrix equation Ax = b by t^e 
matrix P yields x^ = 1 , x^ = 1 r as before. P is the inverse of 

A and is written A*""^; it represents the matrix generalization of 
division. Suppose, a, b, arid c are scalars, i.e., single num- 
bers rather than matrices, and ab = c. Then 



\ 



/ 



-1 , -1 

ab = a c 

l-b = a''^c 



b = c/a' 



**rhe matrix inverse is defined in the same manner. ; If 
AB = C {A,B, and C comformable matrices for the indicated opera- 
tions), then a'"^' is ''defined such that B = a"^C. Let\a special 



inatrix be defined for which each diagonal entry is 1.0 and the 
remaining entries ^re zeroj this is called the identity matrix 

and generally is represented by the letter I. For B « A C to 

beV solution to AB « C, one must have a"^A « I. Hence, this 
relStion defin^.s tHe matrix inverse. TEe matrix inverse is de-- 
fined only for square matrices but may not always exist even^for 

square, matrio-§. / If a"^ exists, then AA"-^ « A~ A = I, and A 
is unique'. 

To gain some insight into the relationship between the ex- 
istence of tl.e matrix inverse and solvability of systems of 
linear e<^uat4ons, reconsider the second pair of I 'near equations 
in the a^ve illustrations: 

■ ^ X, + 2Xj = 3 

3x^ + = 9 

•Recall that r.o unique solution for x^ and X2 could be found for 
this pair o^ equations (numerous solutions exist) . Now rewrite 
the pair it matri^ notation. 

'ox P 



^ * -1 ..... 

It can bf $hown that for this system Q does not exist; findj-ng 
is like finding" the. reciprocal of sero. 

To see this, define the determinant of a 2 x 2 matrix, 
A, by lAl = a^^a22 - ^'12^21' ^^^"^^ i-l matrix de- 

terminant'. It ca'n be shown that the inverse of A for A of 
order 2x2 can be found as follows. 

Clearly,' the inverse of a 2 x 2 matrix exists if and^only if 
|aI i- 0. Calculate next, the determinant of Q = jQ| - 

a a - q q • = 1 x 6 - 2 x'3 = 0. Thus, q'"^ does not exist, 
^11^12 12^21 ^ ^ — 

ref lecting^the fact that the corresponding pair of equations 
does not have a unique solution. 

Mnl accurate definition of the determinant of a general matrix 
requiTBfe more space than can be justified , here . The mam results 

7 • 
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nfeeded for later chapters are: (a), "^he matrix inverse of A exists 
and is unique if and only if [a] ^ OAand (b) a system of simul- 
taneous linear equations has a unique Solution if and .only if the 
determinant of the associated coefficient matrix. A, is not zero - 
JaI ^ 0. These are two ways to say the «ame thing. 

If in the system of equations Ax = .b\ the row order of A is 
less than the column order, the system is vxnderidentif ied ^ — any 
number of vectors x can be found such that fpc = b. If A is • 
square and |a| = 0, then Ax = b also is undeFiderltif ied . If the 
row order of A exceeds the col,umn order , the Equation, system 
Ax = b is over identified , meaning in general, no x can be found 
to satisfy the'relation Ax = b. Each of these cases has been 
illustrated above. \ 

The rank of a matrix is defined as the maximum number of 
rows or columns that are not linearly dependent. For a square 
matrix, if its rank equals its order it is termed full rank. If 
it is not full rank it is called singular . For example 



A = 



is rank 2; its rows and columns arer linearly independent; hence, 
it is full ra-nk. In contrast, the rows (columns) of Q are lin- 
early dependent: 




0 = 




because row 2=3^ (row 1^: 

i{l 2) = t3 6) 
Also , column 2 = 2x (column 1) : 



2x 



'1] 


t-i 




l3i 







This can be written 



-1 



This il'lustratGs^thc geneifal definition of linear dependence, 
columns of A are defined to be linearly dependent if 



Tlie 



Tbc = 0 



J 



i 
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with not ail x zero. In the example. 



note that for a scalar k 
2 kx, k ?«^0 

is also a nonzero solution , to Ax = 0. This illustrates the gen- 
eral fact that if a nonzero (often termed nontriviai) solution 
to Ax = 0 exists, it is determined only to a constant of propor- 
tionality ,<rk. Note that x and z =» kx are linearly dependent 
solutions, since kx - z = 0. An important theorem in linear 
algebra states that a nontriviai solution to Ax « 0 exists if 
and only^if jA] = 0 (for square A). For future reference it is 
noted that Ax' = 0 is a set of homogeneous 1 inear equations , and 
Ax = b', b ?^ 0 is""a set of nonhomogeneous line ar equations . 



Characteristic Equations 

g Few facts alsout matrices find broader application in the 
social sciences than the theory of characteristic equations. 
Factor analyses, canonical correlation and certain estimation 
methods in econometrics depended, ^n characteifistic-equation 
theory. As shown later in the^^pfesent volume , solutions to lin- 
ear differential equation syst^s with constant coefficients also 
depend on* characteristic equations 

One of the fascinating facts about matrices is that some 
constant X can be subtracted from each diagonal entry, and the 
determinant of the resulting matrix is zero, even if the orig- 
inal matrix A is full rank viz, |A|y 0. Consider as an example, 
the matrix A~presented earlier. It is desired to find a constant 
•X that can be subtracted from each diagonal element of A so that 
the resulting matrix is singular . Now XI is a matrix whose 
diagonal entries are X and off diagonals are zero; hence, A - XI 
is a matrix whose diagonaj. elements are a^j - X and the remaining 

elements are a . . 

13 



For A T ^I to be singular, jA - XI [ = 0. 



Written out, this becomes 

\ 



1 - X, #1 2 

- 2, 2 - X 



= 0 



|a - xi| = 

o/= [a - Xl| = (a^^ - A) (3^2 



- X) - 



^12^21 
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2 

- X - (a^^^ + 322) X + ^^1^22 - ^12*21 
0 = X^ - 3X - 2 (for the numerical' example) • ^ 

which is a quadratic equation and has two solutions: ^X^^ = 3.5616, 
X2 «=-.5616.'* Substituting these for X in the deterroinental equa- 
tion |a -XI[ =0 shows the result to be zero. The Xs are called 
eigenvalue s > characteristic values, or characteristic roots of A. 
The equation |A - XI | = 6 is termed the characteristic equation 
of A. "■ \ ■ ' 

It was seen that th^ characteristic equation of a 2 x 2 
matrix is a second degree po-lynomial . 'This instance generalizes. 
The characteristic equation of an n x n matrix is a polynomial 
of degree n. There are n roots X of the equation, but some of 
the roots may equal other roots.. There are at most n distinct 
roots. • • 'v 

Consider the previously defined matrix Q as a second example 
of a characteristic equation . It is desired to find X ^such that . 
Iq - XI I = 0 . One has 



1 - 
(1 


X, 

3, 6 - 
- X) {6 ' 


.1 - ° 

- X) - 6 = 0^ 


x^ 


^ 7X -1- 


(6 - 6) = 0 


x^ 


- 7X = 


0 


X - 


7 = 0, 


X ^ 0 



^1 = 



X, = 0 

4.. 



Checking, one finds (1 - X^) (6 - X^) - 6 = (1 - 7) (6 - 7) 
- 6 = -6(-l) -6 = 0, and {1 - X2) (6 - X^) - 6 = (1 - 0) (6 - 0) - 
6=0. This illustration is a special case of an important theo- 
rem: the rank of a matrix equals the number of nonzero eigen- 
values. Also, the determinent of a matrix can be found by the 
running product of its eigenvalues: 

|a| = 3.5616(-.5616) = -2 (discrepancy due to rounding) 

= 7x0 = 0 , 

8^ Throughout the text matrix elenents involving some operation 
such as 1 - X are separated from other elements by com- 
mas, but single numbers or characters are. not separated 
/ by commas . * ^ 



Clearly, if one or more Xs is zero, the matrix is singular, i 
Consider a further example. Let 



Now.,, the characteristic polynomial is: 

B - XI 1 =0 



{2-X)(2-X) +1=0 



(2 1 = 0 



f There is no 



since (2 - X) 



real 
2 



number that~^s a solution to this equation, 
= -1, and the square of a real number, is never 



negative . 
Jience,. X, 



Evidently, (2 - X) = ± i , where, as before i^H -1 

= 2 + I , aiid X ~ = 2 - I ., 



This last example illustrates an important fact: ■■ the char- 
acteristic roots (eigenvalues) of a real matrix may be complex. 
Howevf^r, if the real matrix is symme^tric, as is A in the example, 
the -eigenvalues are all real — if the matrix is real, it has 
complex roots only if it 'is asymmetric . It is not always, true 
that every or even any' of the eigenvalues of an asymmetric real 
matrix are complex, however . * Another important theorem has also 



been illustrated : If X = 
its complex conjugate X = 



R 
X. 



■ X^x 

-I 



is a root of A, then so is 



Since 
tionsf -(A - 
solution . 
tic ^vector 
so d6es (A 



;x - xi| =0, the homogeneous linear system of eqiia- 
XT)x =~0 (X a conformable vector) has a nontriviaX^^ , 
The vector x is called the eigenvector; of character! s- 
of A- associated with X . As before-, if {A - XI )x = 0, 
- Xl)z , with' z .= kx/ (k a scalar) . If the eigenvalues 
of A_ are distinct ,^vfneaning no two Xs have the same value, then 
there i^ a characteristic vector associated with each X and the 
set of all chasfecteristic vectors is linearly independent . Let 
A be a diagonal matrix with diagonal elements set equal to the . 
eigenvalues -of A — Xs • (and of fdiagonal elements, zero) , and let 
X be a matrix whose columns form the eigenvectors of A associated 



with _A . That is X 
vector of A a-ssociated 



■2' 
with X 



etc 



in', 
and 



where x; is 



"th^ 



eigea- 



■ • • 1 - ' ' ' ' 

where A is squaie v^ith order n. now, if (A = 0, all 

j « 1, . . . , 'n, SO that Ax ^^ = ^^^y then , 

AX = XA_ * 

Since the eigenvectors are independent, X ^ exists, and one finds •, 
-1 ^ 

X AX = A ■ ' • • • . • 

A can be transformed to a diagonal matrix by the given formula. 
In this cage, A is said to be diagonalizable . One also .has 

.r ■ ■ ■ 

A = XAX 

* * 

This rielation fprms the basis fpr some important results in linear 
dif ferenti«al equation systems. It should be noted that, if some 
of th,e roots (Xs) of A are repeated (e.g., X2 = X^) , A may not , 

diagonalizable, although it may, sometiimes be diagonal^ lable ev^n 
if the Xs are not distinct. . Most matrices in empirical work will 
have n distinct roots and, henpe, be diagonali2ab3 e . 

If A is a real symmetric matrix, and' AX = XA_, then the 
transpose relationship also holds: X'A' ^HT'X'? X'A = AX*, since 

~ -1 

A' = A. If A is not symmetric, then AX=XA-^KA = Ax . Thus 

x""^ plays the same role for asymmetric A that X' does for symmetric 
A. One may calculate the solutions to the equation ^' (A-yl^) = 0. 
where y-* is a,l x n row vector and |a - yl| = 0. It turns out 
that the ys are the same as the Xs. Formrng a matrix, whose columns 
consist of ys associated with X s, one has Y - [ ,T . . . , y;^) . 

It follows that Y'a' = AY'. Y is called the matrijc of left eigen- 
vectors of A a-nd^X is tKe matrix of right eigenvectors .' If A is 

symmetric, Y - X? otherwise, Y' = X . Hence , .the distinction 
between right ahd left eigenvectbrfj is necessary only if A is 
asymmetric, . . 

Recall that if A has a domplex root, X = X„ r X_ i ,* then the 

— R , i 

complex conjugate of X is also an eigenvalue of A — X = X^^ - X ^ \ . 

Let X be the right eigenvector associated with complex X, the^ 
X = x_ + XI is also complex, and its complex conjugate • 



X - - x^i is the eigenvector associated with X. 

To illustrate these facts A, X andX are calculated for 



45 




A = 



1 
2 



2 
2 



Q = 



1 
3 



B = 



2 
-1 



all of which have been used in previous examples. 
A, 



For the matrix 



3.5616 0 
0 -.5616 



1.0 -L.28a8 
1.2808 1.0 



3787 .4851\ 
4851. .3787) 



Checking the calculations, one finds 



,-1 



A = 



1.0 -1.28081 
1.2808 1.0 J 



3.5616 0 

0 -.5616 



. 3787 
.4851 



.4851 
.3787 



2 2/ 



For the matrix Q, one finds 

) 



A = 



-1 



wi tl'i 



7 
i 0 



0 
0 



X = 



1 

3 

1/7 
-3/7 



-2 

i 



2/7 
1/7 



Q - 



X 


A 


1 ^-2' 


(7 0 1 




t3 -1 j 


1o 0 j 





r/7 2/7 
i-3/7 1/7 



Q 

1 2 
3 6 
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For one finds 



X 



2 4- I , Q . 
0 , 2 - 1 I 

' 1 + I 1 -a t 



- Ud + ^ ) ' -^4(1 - t ) 



\ 



Checking thlii^ calciilations , again one finds: 

MP 0 ' 



-1 



B =i 



X A X 

1 + ^ , 1 - I \(2 + 1^ 0 Vhil - O, -hil + 1)1 

-(1 - I), -(1 4- i)f\ 0 2 - \j\hil + 1), -5s(i - I) 

-12 



Note that column 2 of X is the complex conjugate of column 1, just 



as ^2 ~ ^ I' 



Matrix Functions 



One of the most remarkable aspects of diagonalizable matrices 
'is that interesting matrix functions may be defined, by use of the 
eigenvalues and eigenvectors. Consider an elementary eV:ample. In 
the preceding section the inverse of the matrix: 



was found to be: 



-1 1 
.1 -h 



and the eiqcnvaluos and eigenvectors calculated te^^be 

( 3.5616 0 \ y ^ / 1 . 0 -1.2808 ^ 

A \ 0 -.5616 I ' - I 1.2 8 08 1.0 

-1 / .3787 .4851 \ 



X 



-.4851 .3787 
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t5ow, the inverse of A aan be calculated by the following formula: 
-1 




-1 



XA X 



1.0 

1 .2808 



-1 

1 



2808 
0 



IC 



/3 
0 



5616 




.3787 
-.4851 



.4851 
. 3787 




The proof that this is a general result is simple and instructive. 
Let A be a nonsingular, diagonalizable matrix, so that A = XAx ~ , 
where A and X *are the characteristic roots and vectors of A, as 
defined previously. If A is full rank (noAsingular ) then , it has 
been noted, all \ the eigenvalues of A are rfonzero; hence A_-l exists. 



-1 -1 

Let B = XA X , then 



Ab = {XAX {XA"-^X~-^') 
^ X.^A'-'-x"-^ 



,1 



I 



AB I 



Hence, B must be the inverse of A 



The square? root of a matrix may be defined in an analogous 



way . Let A 



h _ 



XA^X with A- 0 (meaning every element of A is 



nonnegative) . Then A^A^ =^CXA^ ^) (xA^x"^) = XA^A^X ^ = XAx"^ = A 
This idea can be generalized to any function that can be repre- 
sented by a series expansion. Examples include the trigonometric 
functions, logarithm and exponential functions. The matrix ex- 
ponential a«^,log functions are two important .» special cases, as 
they play Zfi^import ant role in the next, chapter . They can be 
defined by 



^ A -1 
Xe-X 



In A - X (In A)x' 



Using series expansion of e'^ j and In X., it can be found that 
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) 



In A = 




Obviously, e^" ~ = ln(e-J = A. 



Consider a matrix A(t) whose elements are a functibh oft. 
If A(t) is diagonalizabxe such that the characteristic vectors, 
X, are the same for all t,^then the derivative and integral of 
A(t) with respect to (t) can be written in terms of eigenvalues 
and eigenvectors. An example; of particular importance in the 

At 

next chapter is the integral fe— dt. For a sense of closure 

s . At 
and general interest, both the derivative and integral of e~ 

are given: 



dt 



At -1 -1 
' = (Xe- X ) (XAX )^ 

At* ^ At . At, . At 

= e- A = Ae— , since e~ A = Ae— 



Also , 



fe^^dt = X(/e~^dt)x"'^ = Xe-^'^-^x"-'-, |a| 0 

At -1 . -1 -1 
= (Xe~ K ) (XA "^X ^) 



L 



At. -1 , -1 At \ \ J. n 

e~ A = A e- , |A| f 0 
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statistical fi^ncepts ' ^- ' 

It is assu6ied that the reader is familiar wi^h moait bf t^e'. 
important cortcepts in statistics •needed for this ^lume, in- 
cluding, in particular, the popul^tion/saraple mea*fl variance, 
covariance, corrsi^tion, ^east^^^'ares regression, and signifi-^ 
cance tests for statistics such as these." There are some funda- 
mental ideas and concepts in inferential statistics, however, 
that appear to require some explication. In particular, these 
incl\ude the concepts of expected value of a sample statiAic, 
its variance,, bias ,^ and consistency. Th^ "following paragraphs ' 
\ attempt to describe these concepts so-^tirat their meaning will be 
clear when they ar/€^sed in the next chapter. 



"■The distinctio7\s between sample aistribi;Ubion , population 
distribution and sampling distributionl.a^e -critical to thorough 
understanding of elementary * concepts -xn infeiiential statistics. 
Suppose one has a population bfr universe defined by ' the set of 
all case's that a research question addressjes. Assume that for' 
each case in the population it is theoretically possible to ob- 
serve a value* on a variable called* xl Each value of x has a 
probability associated with it, p^^ giving the relativ^ frequency 

of that value of x in the population'.^ The set of all pairs 
{x,p^) is the population distribution of x. The sample distribb- 

tion of x refers to the relative frequencies associated with ee^cH' 
value (or range of values) of x observed' in the particular samplfe 
The term sampling distribution generally is not applied to^ x at 
all. Rather, it is applied to some function of the sample x s, 
such as the sa^iple mean, variance, or median. Take the salnple 
mean as an example, and consider calculating the samplf; mean 
from a very large number of samples, each of size n. The dis- 
^4 tribution of these means o^er all possible samples is termed the 

sampling distribution of the mean. One bf the most remarlcable 
theorems in statistics is that, no matter what the population 
distribution of x, as th6 sample size gets large , the sampling 
distribution of the mean tends to the normal distribution. This 
fact forms part of the central limit theorem. 

The expected value of a sample statistic is defined as the 
mean of that statistic calculated over all possible samples. 
Clearly it is desirable that the expected value ot a sample sta- 
tistic equal the value of the popul^^.tion parameter that it is in- 
tended to estimate. The standard error of a sample statistic is 
defined"^ as the standard deviation of that statistic- calculated 



9. This IS not a precise definition, but is sufficient for present 
purposes. The concept of probability ' element* needed for 
continuous density functions would^complicate" unduly the 
presentation. • ^ 
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CHAPTER 4 



MATHEMATICAL AND STATISTICAL THEORy.IN THE 
APPLIdATI0^5 OF LINEAR DIFFERENTIAL 
EQUATIONS 

i 

This chapter is divided into three main sections. The 
firslj^ section contains a brief discussion of general differen- 
tial ♦equation sy.t« vcik. The purpose of this discussion is to 
show how the pai 'cular application of linear systems with con- 
stant coefficients fits with the general theory, ^nd thereby, 
to communicate an idea of the untapped potential of differential- 
equation systems for representing socjLal processes. Section two 
reviews the mathematical and statistical theory necessary to con- 
vert a model written as differential equations into a form suit- 
able for empirical study. The third section describes a computer 
progjram package"^ that ca^v^be xie^ to carry out calculations needed 
to estimate parameters o^ the- differential-equation model. 

General^ Dif f eriential-Equation Systems 

^IN. In order to gain somte understanding of general differential 
equation systems it is useful to consider specific cases first 
and work 'up gradually to the general case. A very simple case 
arises "for the continuous population-growth model. Its dif- 
ferential equation is of the fpllowi^ng form. 

(5a) dy/dt = by , v 

Where y is population size, b is the constant rate of popula- 
tion growth, and, as before, t is time. It is useful to sum- 
marize the meaning of this simple differential equation, be- 
cause in most important respects the meaning generalize'^s to . 
more complicated differential equations. One might ..view the 
lefthand side (dy/dt) as the < "dependent variable." The depen- 
dent variable, then, is the rate of change in population size 
(y) at a given instant in time. The righthand side of the 
equation may be interpreted as a hypothesis about the manner 
in which the rate of change in population size occurs. In this 
simple example, it is hypothesized that rate of change in popu- 
lation size is a constant number (bT multiplied by the current 
population size (y) . More generally, the righthand side will 
contain the current valde of y, time, and any number of ad-- 
ditional independent variables. ^ The functional form of the 
right side may vary depending on the substantive application. 

Integrating' equatioi^ (5a) generates the familiar exponen- 
tial- pdpuidtion growth curve, as follows: 

(5b) yj- y^^^^^ 



whero t is a specific point in time, and y^, are population 

size at time t and time 0, respectively. Equation (5b) also 
describes the accumulation of principal on an investment With 
a constant rate of interest (return) of b and continuous com- 
, pounding of the intereSTTs " . 

Equation (5a) is a single linear differential equatitpn. It 
becomes nonlinear if the functdon of y on the right aide is non- 
linear. For example, 

(6) dy/dt = b sin y 

is a single, nonlinear differential equation. ^ 

If there is more than one equation, then one has a system 
of simultaneous differential equations. For example, the fol- 
lowing Bpir of linear differential equations form a special 
case of the model of career expectations given in equations (4). 

(7a) dy^/dt = b^^^y^^ + ^^2^2 
(7b) dy2/dt = ^21^1 ^22^2 

where y^^, y2 variables, the b^^ are constants, anS^ t^ is 

time. Since there is no constant intercept in equations (7) , 
they form a pair of homogeneous linear differential equations. 
Adding an intercept makes the system nonhomogeneous and parallel 
in every technical respect to the dynamic model of career ex- 
pcrtations in equations (4). Equations (8) show an example of 
linear nonhomogeneous system composed of two equations. 

(8a) dy-^/dt = + ^11^1 ^12^2 
(8b) dy2/dt = ^2 + ^21^1 ^22^2 

where and a2 are constant intercepts ,■ and the other notations 

aro defined as in (7). * 

Equations (7). and (8) are linear systems with constant coj^ 
N efficients.' More generally, the coefficients of linear systems 

i^Ki7"bc~t&en as functions of time. For example, suppose that y^ 

represents ego's level of educational expectation and y2 is 
sigi^.if icant other's level of educational .expectation for ego. 
The idea that the influence of the significant other on ego s 
educational expectation level docTines with time could be ex- 
pressed by mailing b-j^2 ^ declining function of time, say b^2 " ^ 



q a positive constant. 



r 
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So far, all* the examples have been first - order - differential 
equations, because only th6 first .derivative appears in them. 
Derivatives of any order may -appear in a dif fe];jBnt^al equation. • 
The order of the hif^hest derivative in a given^^ij^ferential 
equation defines the order of 'the equation.^ . Tfte^ order of a sys- 
tejn of differential equations i^ defined by the sum of the orders* 
of the equations forming the system. Me.aningful examples of 
second and hinher order differential equations involving social 
or psychological variablca are scarce, but there are numerous 
physical e>:amj51es. Probably the "most common l^hysical example 
rel4.tes to distance traveled as a function of time. Speed is 
the first derivative and acceleration is the second derivative 
of distance wsith respect to time. For Example, if y is distance 
and t is timc/^onc might posit that acceleration of a vehicle 
is a- positive but declining function of time, say 

(9a) c^^v/dt" = ce~^^ ■ 

whore c and d are positive' constants . \^ith this equation one .t 
may find the distance traveled from a standing start as a 
function of time; as follows 

(9b) y = (c/b)t + (c/b*^) (e"^^ - 1.) 

where y^ is distance traveled from a standing start after t 

minutes have elapsed. Thus, the single second order differen- 
tial equation (9a) can be operated on to produce a prediction 
about distance traveled from a standing start at every point 
in time. These predictions could be compa'red to data to see 
if the acceleration hypothesis is correct. 

Although social science definitions are seldom given as 
second derivatives, there are some concepts that might be de- 
fined fruitfully in this manner. For example, one might define • 
learning speed as the change in information divided by the 
change in time . At an instant in time, then , learning could 
be defined as the derivative of information with rospect to 
time. A change in learning speed therefore , would be th6 
second derivative of information with respect to time. Thus, 
the concept of accelerated learning could be given a pregise 
definition quite analogous to the physical concept of accelera- 
tion. Similarly, "vertical" occupational mobility could be' 
defined as the derivative of occupational status with respect 
to time, BO that change in mobility rate would be a second 
derivative. A hypothesis about changing rate of mobility, 
analogous to the hypothesis about acceleration in equation {9a) , 
could be used to generate predictidns about occupational status 
level at any point in time. ^ 

In all the examples, derivatives have been with respect 
to tirae alone. Whenever differential equations involve deri- 
vatives with respect to only one variable they are termed 

s 
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ordinary differential equations. If derivatives with respect 
to mo^e than one variable are included one has partial dif- 
fereritiai equations't 

A very general first-order, ordinary differential equa- 
tion system can be written as follows: 

dy^/dt = ^i^Vi' y-^' • ' •' ^k' 
(10) dy^/dt = f2^yi' ^2' ' " ^k' 

dyj^/dt = fj,(y^j^2' • • ^' ^k'"'^^^ 



where y^., y^, . . . , Vj^ comprise a set of k variables, t is 
.time/ and are ^ny continuous functions of the ys and of t. 

' ' More general equation systems involving higher order deriva- 
tives and partial derivatives can also be written but are not 
needed for the present discussion. 

The next section of this chapter develops the technical 
aspects of linear systems of ordinary differential equations 
with constant coefficients. As illustrated in the present 
section there are numerous ways in which the restrictive as- 
sumptions associated with such systems can be relaxed. 

Technical Aspects of Ordinary. Linear 
Differential Eg-uat ion Systems with 

Constant Coef ficients " ' ' 

- — ~~ ■ / 

It is impossible to observe all the terms in a diffekren- 
f tial equation, since a differential equation always contains _ . ■ 

at least one derivati^ve, the derivative being a slope at/a ,. 
single point along a smooth curve. Take equation' (5a) as an 
example. The dependent varikble is the instantaneous rate of 
change in population size -- 

^1 ".^0 

dy/dt = 

lim ^1 0 

■ V^O 

where t^ and t^ are the two points in time that are very cxose 

together and y^ and y^ are the population sizes ^t»^hose • time 

points. Of course, it might be possible in theo^ to approx- 
imate the derivative at several time points by drawing a se- 
quence of observation pairs with the time points of feach pair . 

spaced a finitC(^m: short distance apart. In practice, however, 

S5 ■ ' , 
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this strategy would be difficult or impossible to carry out for 
inost social or psychological topics. A much more practical^ 
strategy is to operate on qeuation (Sa) to produce (5b), which 
can be compared readily to observations. 

f 

Note that if one. differentiates (5b) with respect to tr- 
one finds ' ' . 

- " • / 

\ bt ■ ^ 

dy^/dt = bCy^e ^) = by^, since y^ =.yo^ - ^ 

This , exercise shows that if one differentiates (5a) one obtains 
equation (5b). Equation (5b) is, therefore, the antiderivatjve 
or in definite integral of equation (5a). Equation (5b) is 
termed, therefore, a solution of equation (5a). Referring to 
the example regard.ing acceleration given bv^quations (9) , the 
relationship between (9a) and (9b) is paratTel to the relation- 
ship between (5a) and (5b) ; equation (9b) is a solution of the 
second order differential equation, equation (9a) , meaning that, 
if (9b) is differentiated twice, one gets (9a) back: 

2 dt dt 



dt 



^ d ^c^ ^ £.(e-^^ - 1)] 
dt dt b ^2 



d r c c -bt 1 

dt ' b ■ b ^ ' 



^2 \ -bt 
d y =^co 

— 2 
dt 

As does equation (5b), equation (9b) givea predictions that 
are observable. In general, finding a solution to a differential 
-equation may be interpreted as the process of converting hypoth- 
eses about change at every instant in time into a form suitable 
for use with -empirical data. 

The primary objective of this section is to describe prac- 
tical methods for converting the dynamic model of career expec- 
tations given by equations (4) into a form suitable for use in 
empirical v?ork. The section is subdivided into two subsections. 
TTlc first subsi^ction develops the mathematical theory necessary 
for finding integrals of linear systems, and the second _ sub- 
section considers statistical issues related to estimating para- 
meters of the integral equation. 
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Mathematical Theory 

This subsectian contains a review of mathematical solutions 
to systems of ordinary linear differential equation^ with con- 
stant coefficients; it does not purport to develop a rigorous 
statement of the mathematical theory. The review unfolds grad- 
ually, developing first the basic_tiieDry f or l^dmogeneous sys- 
tems, then addifig constant intercepts, and, finally, including 
^-a set of disturbance variables. , 

Consider the following equation expressir^ the general 0 
model of a linear differential equation system. 

(11) d;^/dt ^ a(t) + B(t)Y. 

where 

y = a K X 1 vector of variables 

a(t) = a K X 1 vector of intercepts that are continuous 
~ functions of time 

D(t) = a K X K matrix with each entry a continuous function ^ 
. ~ of time 

d^/dt = a K X 1 vector of derivatives of y. with respect to 
time 

If a(t) = 0 for all t, then the system is termed a homogeneous 
system. If every entry of B(t) is a constant over time the 
system is a lindar system with constant coefficients . 

Attention now focuses on homogeneous linear systems with 
constant coefficients, because essential aspects of the results 
for nonhomogcneous systems can be derived from the theory for 
homoqeneous systems. The homogeneous system is written 

(12) y - By 

where y = d^/dt, and B is the K x K matrix of constant coef-_ 
ficicnts. The goal is to operate on (12) to produce a function 
giving each element of the vector Y as a function of time, sub- 
ject to the restriction that equation (12) holds. 

Consider first a one-element vector and matrix, so that 
(12) becomes, a scalar equation rather than a matrix equation. 

(13) dx/dt = bx 

whore x,b are scalars. The simplest way to- solve (13) is to 

divide both sides by x and multiply by dt, producing 
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dx/x = bdt 
integrating both sides yields 



= /bdt 

X 

(14) In X - bt -H q 

bt^q bt, q, 

e is the mathematical constant which is the base of the natural 
logarithm, q is an arbitrary constant ^ and In is the standard 
notation for the natural logarithm used throughout this volume . 
Notice that if the constant q were known, equation* (14 ) achieves 
the goal for scalars — x is expressed as a function of time* 
Note also that the derivative of (14) satisfies the differen- 
tial e_guation (13) , for ^ 

dx/dt be^^{e^) = bx , since from (14) , x = e^^(e^) 

This simple method for finding a solution to (13) does not 
generalize conveniently to linear systems , jpince one cannot 
.divide by a vector , y . It is useful , therefore , to develop an 
alternative derivation of the solution (14) . To do so r sub- 
tract bx from both sides of the scalar differential equation 
(13) ; this yields : 



dx/dt - bx = 0 
Multiply both sides by e 



-bt 



to get 



e^^^ (dx/dt - bx) ' = 0 

--bt -bt 
Notice that de k== e dx/dt 



-bt 



bxG by the multiplication 

rule for derivatives; hence, the above result can be rewritten 



^ -"bt 

-de X = 0 



Integrating both sides yields; 



-bt q 
e X ^ c - e 

X ^ e c 



where c is an arbitrary constant . 
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This result matcites (14) with C « e^'^and, as it turns out, this 
second method of arriving at (14) generalizes rdadily to sys- 
tems of linear, homogeneous dif feirential equations^ 

^' . * ■ ■ • ,, 

To find the constant 6, let x^, designate the value of x . 

at a point in time, t. Now, note that, from equation (14),, 
at the zero point on the time scale x may be 'found by 

• ■ ♦ 

e (e^) = e c « . . . 

x_ = c 

o . , - 

Hence, c can be set tO' the observed value of x at an initial 
panelr in a longitudinal studj^. The value of x can then be 
made a function of time" by the following formula. 

*' 

, , - , bt bt q ^ - ' . 

(J5)x. ~ec = ee* : ^ 

•* ' _ bt . • .. 

where t is any point in time. This result Us, of course, the 
equation for the exponential population-growth model noted 
earlier in this chapter. \^ 

Equation (15) could be used in conjunction with a two- 
panel longitudinal study with one (or more) observations on 
X at t„ = 0 and at = t. Define b* = e" a v rewrite (15) , 

as f9ll9ws. ' . ''"^ 
(l'5a) ^*^q' ^• 

»- 

Obviously from (15a) , b* = x^/x^. One can calculate b from b* 

as. follows-:-' b = (In b*)/t. Substituting f<^ b* , one can cal- 
culate b directly. 

(16) b - [ln(x^/x^) ]/t 

Hence, if there is no disturbance (error) termi an observation 
at two time points (t^, t^^) for a single case suffices to gen- 
erate exact predictions at every point in time — by substituting 
the value Of b calculated from. (16) into the • prediction equa- 
tion (15) . , 

. - ■ ' • 

The groundwork has now been developed for finding a solu- 
tion to the* linear homogeneous system, equation (12) . To ar- 
rive at a solution, retrace the steps used in thfe second deriva7 
tion of equation (14). Start with (12) and subtract By from' 
both sides. • 



Premultiply both sides by e , where e is a matrix exponential 
-Bt , • „ . ~ ' ' ' 

. t2 - (y - B^) o 

m 

It cai% be shown" that the derivative of the matrix product, 

"•Bt —Qt "Bt 

e — ^, is. equal to e — ^y - e.- B^. Consequently, the above ex- 
pression can be written as follows - • 

■ . dq = ^ . , . 

just as for the scalar case. Integrating both sidos yields 

i ' 

. ■ -Bt ' ■ . " * ^ ~ 

e _ ^ = c . 
(17) ^ . 

^Bt 

y = e~ c_ 

where c is a K x 1 vector of- arbitrary constants. To find the 
vector~of constants', as for the scalar case, let t = 0, 

- ^' ^t value jof the vector y at time t. 

Then fruni (17') 
B • 0 

- c = I'C, I_ K X K, 'identity matrix 

c - y 

Substituting ^ into (17) yields the desired prediction equa- 

tion: 

- (18) y.t ^ y^ . 

Again, as in the scalar case, the value of B can be fo\ind from 
longitudinal data. There are at least two possibilities. First, 
one might have af least K cases at two time points (K - the 
number of variables) . Secondly, one might have at least K + 1 
observations for a single case spaced at equal time intervals. 
For the first instance, let be a K x k matrix of observations 

on the K variables for K cases, at time t, and let Y be the 

analogous matrix at time zero. Assuming is invcrtable, one 

finds 
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^Bt _ Y Y -1 

(ln(Y.V~^) is a matrix logarithm, not an elementwise log)\ If 

there are more than'K cases, any siibset of K cases can be taken 
so long as those selected are linearly independent, so that 

y ^ exists. 

-o • * • • • 

For the second isntance in which K + 1 or. more observations • 
on a single case are available, Y may be taken as K coliimn 

ve^w. rs, each column being an observation on K variables at 
single time point, starting with t^ = 0; can be defined 

analogously, ljut with the first colvunn of being .a set of ob- 
servations at the second time point ^ that is 



Y 



The length of time between adjacent observations must all be 
the same. 

Recall the definitions of the matrix logarithm and matrix 
exponential given in the mathematical review, chapter 3, viz . 

(20) In M = V(ln A)y ^ , |m1 ?^ 0 

(21) e- = Z^-v""^ 
V re 

M =^ a diagonal izable K x K matrix, |m| = determinant of M 

A = a K X K diagonal matrix wi-th diagonal entries containing 
" the characteristic values of M 

V = a K X K matrix whose columns are the characteristic 
vectors of M associated with \ 

The functions In A and e- are defined on each diagonal element 
of Af. i e . , 
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In X, 



(Recall from chapter 3 that , these definitions do fit the g^era!^ 
definitipn of matrix logarithm and of matrioc exponential, ^hat 

IS, InCe-) 



InA . . 
e - = A) . 



If every element of a matrix is multiplied by "Ja scalar, as 
in equation (18) where B is multiplied by the scalar' t, then 
the clharacteristic values of the ';?esultirig matrix, Bt, are 
just the characteristic values of the matrix? B multiplied by 
the scalar t. Thus, if A is the diagonal matri>J of eigenvalues, 
of B, then At is the diagonal matrix of eigenvalues of the 
matrix B t . Hence, equation (21) can be wrdtten as follows, for 
the matrix Bt. » 



(22) 



Bt „ 
e- = Ve 



-1 . 



where V, A are now taken as eigenvector and eigenvalue matrices, 
respect ively , , of B. 

The matrix derivative Tie- = e- B was used to derive the 

dt — 

solution to the homogeneoue linear system, equation (12), the 
solution being given by equat;ion (17). Equation (22) can be \ 
used conveniently to derive thfe formula for the derivative of 
t^he matrix exponential , e_ . 



d Bt 

— "O— 

dt 



At -1 
[Ve- V ] 



dt 



», /cL^At. -1 . 
~ dt — 



Note that if no diagonal entry in A is zero, the derivative of 

At 

the diagonal matrix, e- , is 
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d.Xt 



dO/dt 
dO/dt 



dO/dt 
de^^Vdt 



dO/dt 



. . dO/dt 
dO/dt 

.. de'^Vdt^ 




0 I 



0 



0 ... e^KH 




Consequently , 



2t ^ - — 



= VAe 



= VAV ■'■Ve-^v'"'^ 



-1 



-1 



At. -1.. 



= e- B 



_ Bt 

= Be- , 



Since B = VAV 



-1 



Having found the solution to a homogeneous system of linear 
differential equations with constant coefficients, one ig pre- 
pared to deal with nonhomogcneous systems with constant coef- 
ficients. The Hiathematical theory is presented first, then it 
is applied to the model of developing career aspirations *given 
i^ equation (4) . , " ' • ^ 

The nonhomogeneous system with constant coefficients is 
written follows. \ 

(23) y_ = a.(t) + By • 

w/iere a(t) may be a variable function of. time, but the matrix 
B is fixed over' time. To find the solution to (23) / trace 
through the same steps used to solve the homogeneous system. 
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i - = a{t) 
e"-^(i - By ) = e"^^a(t) 

d(e""-^il) = e--^a(t)dt 

-Bt \ -Bt , ^ . ^ 

e - = /e - a(T)dT + c , 

. o 

/ ^ • i 

(24) ^. = e-^c + G-^ !Q~^\\i)6r^r^ 

o 

v/here t is the dununy variable of integration as explained or> 
|>age 31. Setting = 0 and solving for the constant- vector, 

c, leads to the conclusion that c = Inserting c = 

(24) yields 



Bt ' Bt r-BT , . , 

(2 5) = s~ ^~ ^ 'L^'^i^'^ 



Notice thdt the term e- /e - a(T)dT is a K x 1 vector. 

o 

t 

.Denpte it by a* - e-^ /e'-^id ) di , artd l^t B* = e-^ * 

o 



Eqviation (2 5) can now be written 
(25b) y^ = a* + B*y^ 



For two fixed points in time, t^ = 0 , and t^ = t, equation (25bj 

is linear across observations. Thus, if k + 1 or more observa- 
tions at t =0 and ti = t are available one may form the 

matrices 
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a 



^21t ''■ ' ' ^2Kt 



'Kit* . . . ^KKt 



, and 



V 



P = [a*, B*l 



,Then for two time points, ~ ^' ^1 ~ ^' * 

(4) = 

where ' |Y^) stands for the determinant of Y^. 

As with the homogeneous case, if more than the minimum number of 
cases are available at t and at t, , then K + 1 lijiearly inde- 

pendent cases can be selected arbitrarily. Also, if K + 2 time 
points on a sing^ case are available, can be defined by the 

first K + 1 observations and by the second through the K + 2 

cases; then equation (26) still holds. 

Once P = [a*f B*} is found, a* and B* can be used separately 

to find the parameters of the nonhomogeneous differential equa-. 

Dt " 
ti'on (23) . Since B* = e— , one has 



(27a) B - (In B*) /t , { B* ] 7^ 0 
where In B* is the matrix natural logarithm. 

Note that this is the same solution as for th^ homogeneous case. 

h 

Appropriate use of a* \^epends on the nature of the function 
.^it) . hot . 

^ -BT 

^ ^( t , a) = '"e - a (t ) di 
o 

where a is a set of parameters of the f urtction a (t ) . One has 



65 



a* = e-^g^(t, a) 

(27b) „^ 

e = a^t, a) 

Under fairly general circuinstances , equation <27b) can be solved 
for the unknown parameters of the differential equation. To 
illustrate ; two cases may be useful. First, let the function 
a(t) be constant ov^r t, say a(t) = a a set of constant 

coefficients. Then equation T27b) becomes 

2 = l^e-^ - I^''^^:*' - II 5^ 0 111 ^ ° 

For a second example, consider a more ' involved instance. 
Suppose that each element of a{t) is the cosine of t + a^, 

and denote the entire vector by a(t) = c{a4-t). In this case, 
,the vector a(t) oscillates over time, but the different elements 
of a(t) do not oscillate in phaso unless all are the same. 

2 -1 r -Bt 

The function g^{t,a) = /e - c(aH-t)dt = (I + B ) {e - [s(a+t) - 

Bc(a+t)l - Is (a) - Be (ct) ]} , where the elements of s(a+t) are 
sinTa^+t) . This result can be substituted on the right side 

of (27b) , as follows: 

e"-^a* = (I + B^)"-^{e"-^[s(a + t) - Bc(a+t)] 
- s { a ) + Be { a ) } 
(T -I- R'')a-*= s(a+t) - Bc(a+t) - e- [s(a) - Be (a)] 

since (1 + B ) commutes with e- . Because the sine and cosine 
functions are periodic both with period 2-it , if t is judisciously 
selected so that t = 2nTT , n a positive integer, then tnis formula 
reduces to 

(I_ + B^)a* - (I -.e"^)[s(a) - Be (a)] 

In '"either form, it is unlikely that a simple algebraic solution 
can be found for a. Nevertheless ,^ for any application where B 
and a* can be calculated and t ig known, it is likely that a 
numerical sol\ition can fc^fe found. Use of numerical analysis to 
find solutions to equations that are intractable or insoluable 
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algebraically is an option that is not appreciated by many social 
researchers. One of the purposes of this example is to empha^size 
that algebraic solutions are not necessary to the conduct of em- 
pirical research. t 

The mathematical theory required for the model of career 
expectations given by equation system (4) has been summarized 
in the preceding- discussion . It now remains to translate the 
general results into a specific form suitable for use with the 
substantive theory of career expectations. To achieve this 
translation it is desirable to rewrite equation system 14) in 
matrix notation, as follows. 

. (28) ^ §1 ^^^^ ' - V 

where 

'dy, /dV 



/dy^/dt\ 
dy2/dt ^ 



/dt / 



a K X 1 vector ofi^er ivatives of y. 
with respect to time 



A ^- 



/^lO ^ii-* 



/ 



^20 ^21" 



2L 



K X (L + 1)' matrix of intercepts 

{a^Q) and effect coefficients for 

exogeneous variables {a..f j>0) . 

^ J 

These -coefficients are constant 
over time 



X 




an (L +.1) X 1 vector, with the first 
element being the constant 1 . 0 and other 
elements being the L exogeneous variables 
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B - 



^21 ^22" •^2K 

\ : : 



a K X K iiiatrix of effect coef- 
ficients indicating the impact 
of the endogenous -variables on 
each other. These coefficients 
are constant over time . 



y = 



^2 

* 



= a K X 1 vector containing endo- 
genous variables 



/ 



u^(t) 



u(t) = 



U2 (t) 



\;^(t 



= a K X 1 vector of disturbance func 
tions of time 



) 



Note that Ax u(t) is a K x 1 vector which is a function of 
time; hence, one may define a(t) = Ax + u(t) and apply the 
mathematical theory of the preceding discussion. The result 
of this application is an equation in which time t values of y 
are shown as a linear combination of time-zero values of y and 
of the predetermined variables, plus "disturbance" or error 
variables (u*) , as follows: 

{2 9) = A*x + B*^ + u* 

where, by use of equation (27a) 



{30a) B* = e-^ , and 

{30b) B = (In B*)/t , |b*1 ^ 0 



To find A* and u* , one may apply equation (27b). For this 
application, the definite integral on the right of {27b) can 
be separated into two additive parts: 
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The first terra on the right of this equation has an explicit 
algebraic solution which depends on the rank of the matrix B. 
If B is full rank (|b| 5^ 0) , the following solution holds i ' 



/e — ^Axdi 



= (I - e~-^)B"-'-Ax, 



B 



i 0 



Inserting this result in (27b) and solving for A* yields 

Ola) a* = (e-^ - I) B~-^Ax + e^^/e"^'^u(T]ldT , |b| ?^ 0 

o 



If the coefficient matrix B is not full rank {|Bi = 0), 
some additional notation is needed to define the solution. Let 
A be the K x k diagonal matrix with diagonal elements equal to 
the eigenvalues of B. If |b| = 0 , at least one diagonal 
element of A is zero. Let V be the matrix whose columns are 
the righf elgenveators of B associated with A, so that, if B 

is diagonal izable , B = VAv ""^. Assume there are nonzero eigen- 
values (Xs) in A and K2 zero Xs, + K2 = K, and let A^ be the 



K 



K 



diagonal matrix with nonzero X^ , i = 1 



K, 



1 

diagonal. Define A^ to be the K2 ^ null matrix containing 
the zero Xs. Also, let be the right eigenvectors as- 
sociated with A , and V the' K- right eigenvectors associated 

~ —1 *~ M \ 

with A 2 . Similarly, let V be partitioned into , a 

K, X K matrix of transposed left eigenvectors of B associated 

(2) 



in the 



with A^, and ' , a ^ K matrix of transposed left eigen- 



vectors of B associated with the zero Xs; 
thus, partitioned as follows. 



A2, A, V and V 



-1 



are , 



J 



A^' = 



Ai 



0 



/ 



Ai 



0 
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0 



0 



7; 



V 



-1 



I.J. 



(1) 



\(VK) 



^1 ^\r- 

0 0 / I V 



(1) 

(2) 



(1) 



and 



Q- = ve- V = (Vj^, V^)/ e-1 

0 



0 \ / V 



(1) 



I / \v 



(2) 



-1 



Using these partitions of A, V and V , the desired def- 
inite integral for the case when B is less than full rank and 
diagonalizable can be written 

/e"~^AxdT = [ (I - e"-^)V^A"^V^^^ + t V^"^^^^ 

Substituting this result ijitcji-the r:^ght side of (27b) and solv 
ing for a* yields 



Bt 



r^tw.A -^V^^> ^ t e^Vv^^^ 



+ e-^/e*-^u(T)dT 
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which, after simplification, can be written 

. (31b) a* - Kg^*" - Vlilii'^^^^ + t-V^V^^^Ax 

t 

Bt_ (2) (2). 
pince it can, be shown that e—W2V = 

The unsolved integral in (31a) and (31b) has no solution 
unless the function u(t) is defined. Fortunately, the function 
can be left unspecified and treated as part of the disturbance 
term in the statistical analysis: 



* Bt ^ -Bt , . , 

u = e- /e - u{t)^t 

o 

where u* is the notation for the disturbance variable (see 
equation [ 29 ] ) . 

Bt -1 

In equation (31a) , the term (e- - I ) B A is a K x (L + 1) 
matrix . Its elements can be considered coefficients of the 
exogenous variables . In equation (31b) , the matrix 

[(e^^ - I)V^AV^^^ + t-V^V^^^-l^. is also a K x (L + 1) matrix 

of coefficients of the exogenous variables. In both cases, the 

coefficient may be denoted by A and used m the integral equa- 
tion (29) . Thus, one has 

(31c) A* - (e^^ - DB^^A , if ! B ] ^ 0. 



r 



Bt ^,-1 



(31d) A - B(e-^ - I) A* 
(31c) A* = [(c-^ - nv^r^V^^^ t-V^V^^^iA, if |Bi^ - 0 

(31f) A = [(e^^ - nv^Ai'^V^^^ t.V2V^^h"V 

> 

It may be helpful to summarize these procedures. The steps 
in the analysis arc listed and described below. 

Step 1. Form a hypothesis about continuous change over 

time fay writing a' nonhomogeneovis system of linear 
differential equations with constant coefficients, 
written as follows: 

(32)^=Ax+By+u{t) 

where 
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* r 1 



V 



I ■■ 

^ = a K X X vector of derivatives of 'y.. with respect to 
tif«e . ^ •• 

x » an (L + l)x 1 vecto^.of exogenous variables wi-th the 
first element set to the constant value 1.0 

^ - a K X 1 vector of endogenous variables 

u(t) = a K X 1 vebtor function of time, considered to be a 
residual disturbance vector 

A = a K X (L + i) matrix of constant ^Coefficients as- 
sociated with the exogenous variables, the first 
x.^^ column representing intercepts 



( 



B - a K x K matrix of constant coefficients indexing^ ef- 
fects of endogenous variables on changes in endogenous 
variables 

Step 2. Integrate the differential equation (32) to form 
a prediction equation: 

(33) y^ = A*x + B*^ + u* 

where the subscript attached to ^ designa^s a specific point in 
time, and A* and B* are K x (l + 1) and K x K coefficient matricfes, 
respectively. . ^ 

Step 3. Estimate the coefficient martices A* and B* by 
• some type of regression methodology applied to 
longitudinal data. The longitudinal data must 
contain at least one observation per case for 
each exogenous variable x and two sets of obser- 
vations per case for eacH" endogenous variable — one 

set of observations at t and one at t, . Thus 

Q • 1 • 

a tWo-panel design is sufficient for empirical 
estimates of the coefficients . Equation (33) 
follows the form of simultaneous structural 
equations as given in the econometric literature. 
If one assumes x and are uncorrelated with u* 

then "ordinary least squares" regression can be 
applied. Otherwise, s£)ine different regression 
method , should foe used. Statistical methods will 
be discussed in the next subsection of this 
chapter. 

Step 4. Calculate estimates of the coefficients of the 
differential equation system (32) by using the 
following formulaei 
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(34) ..^B (lnKB*)/t 
(35a) A « B(B - 1)"-^^*, if t^l 0 ' 

{35b) A = ((e?^ - VY.iLi'^^^^ + t.V2V^^^^j"V, if [Bj = 0 

where In B* is a matrix logarithm defined by f^quation (20) . 

OncG the matrices A and B have been estimated, they can be 
applied to generate pre3"ictions to any point along a continuous , 
time scale. These predictions can be compared to observations 
as a test of the theoretical model. ' 



Statistical Estimation . • 

This Subsection briefly summarizes statistical theory re- 
lated to estimating coefficients of the integral equation (33) 
by regressiion methpds.. Unlike the mathematics of differential 
equations, the relevant statistical theory is familiar to many 
social scientists via the econometric and, increasingly, the 
sociological and political science literatures (see, e.g./ 
Johnston, 11963; Goldberger, 1964V Duncan, .1965; Heise, 1975; 
Goldberger and Duncan, 1973? Ostrom, 1978; Asher/ 1976; Board- 
man and Murnane, 1979). In consequence, the treatment of 
statistical theory -* this* volume need not be as detailed as 
the tr^eatment of _nematical theory^ In particular, this sub- 
section develops the assumptions underlying application of or- 
dinary least squares (OLS) to" two panels of data in order to 
secure estimates -of the- coefficient matV ices A* and B* iur equa- 
tion (33). Readers interested in di,ff erential equation systems 
for cjeneral applications should be aware' that numerous alterna- 
tive estimation techniques are available; these will be sum- 
marized briefly toward the end ofMiis subsection. The present 
focus on OLS applied to two panels of data reflects likely 
needs in status attainment -research, for ■^e -immediate future. 
First time .series of 'data with nximerous tim? points ^seldom are 
available in career-decision making research. Secondly, ap- 
plication of dif ferentival equation models in career-decision 
making research is new^ ■ ,^ variety, of statistical methods may be 
tried out as experience dictates. 

Equation (33) may be interpreted as a special case of^ a 
simul.taneou's equation system, and ohe may consult the econo- 
metric literature for investigations 'of appropriate statistical 
analysis. Befoj?e ad4rea^4^g the specific task of statistical 
estimation of equaWon'^.i^^i^; it is useful to review the broad 
outlin^^s of econori^tric th&ory of simultaneous linear systems. 
In this theory,, sets of. liitear equations are the* object of 
study; each equation ex^^f esses a hypothesis about the substance 
of one's topic. Consider the., following pair of structural equa- 
tions as an "example. '■ '■••t " . 
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over samples. The variance of the statistic, as with all var- 
iances, i '> just tne square of the standard deviation (standard 
-=ii^^or) . ."dertaihly, the smaller the variance of a statistic 
the better, "ceteris paribus. If one sample statistic has a 
smaller variance than a second statistic estimating the same 
* population parameter, the former .is termed more gf fic^g^tt than 
the latter.' As. an example, part of the central limit theorem 
states, that. the expected value of the sample mean is the popula- 
tion mean, and that the variance of the sample mean of x is the 
ixjpulation variance of x, divided by the sample size. Similarly, 
the mean over samples of a sample regression coefficient (m the 
fixed-effects model) is the corresponding population regression 
coefficient, and the variance of the sample regression coef- 
ficient is a function of the standard error of estimate and a 
diagonal clement of the inverse of the matrix of cross products 
among the independent variables. 

'Two concepts of major importance in structural-equation 
analysis are' bias- and consistency . ; Let s by any sample statistic, 
such as the mean or sample regression coefficient, and let the 
corresponding population parameter be denoted by S.- The expected 
value of s is denoted by E(s). The bias of s is the difference 
between its expected value and the value of the corresponding 
population parameter: 

Bias (s) = E(s) - 3 
>. ' . 

If E(s) = S, s is teimed unbiased . If E(s) 5^ S but Es 
approaches S as the sample size goes to infinity, s is^ described 
as asymptot ica ily unbiased . If the variance of a statistic goes 
to zero as the sample si2e goes to infinity, then the statistic 
is termed consistent . A .onsisteot statistic is asymptotically 
unbiased, but asymptotic unbiasedness obvious^ly does not assur^ 
consistency. Unbiasedness is a desirable characteristic of a,' 
^ • - • " it is necessary to accept a sample 



statistic, but frequently, ...^ ..-wv---v*^j 

statistic that is consistent; this is the case, for example, with 
many econometric methods such as two-stage least squares, 
li|Tiited-inf ommtion, maximum-likelihood estimation, arvi si 



so forth. 



:>i 



ERIC 



: 36a) = p^^^^ + p^^z + g^^^^ * ^1 

(36b) = P20 ^ P2l'= * ^21^1 * ^2 



where , y^/ 2 are observed variables, p. . and q. . are con-- 

stants, and v^^ v^ are unobserved disturbance variables. In 

{36a) y-^ is hypothesized to be affected by 2 and y2 / and in 

(36b) is considered an effect of 2 and of y^. Thus, y^ 

and y^ exhibit f^^dback effects on each other , but z is not 

affected by any variable in the system. Variables such as z 
are termed exogenous variables ; variables such as y^ and y2 

are endogenous . More generally, endogenous variables are vari-- 
ables-nvhose values are determined, in part by' other variables 
included in the system; whereas, exogenous variables are those 
whose values are determined outside the system, i.e.) th6 
theory does not account for their values. 
# 

For statistical applications , the most important distinction 
between exogenous and endogenous variables is that the exogenous 
variables are assumed to be uncorrelated with tke disturbance 
variables; whereas, the dndogienous variables are permitted to 
be correlated with the disturbance variables. It is argued that 
if y^ aff'^cts y^f then the disturbance for y-^ is likely to be 

corr£»latcd with y2 , ^nd vice versa. This is not a deductive 

argument, however; rather it is based on common sense. This 
point- does not seem to be understood "clearly in the literature . 
Often, readers might acquire the impression from published 
accounts that there is a deductive argument demonstrating con- 
clusively that if reciprocal causation occurs between y-j^ and y2/ 

, then a correlation necessarily must arise between y^ and the 
disturbance associated with y^ , and analogously, that y2 neces- 
sarily mi s w be correlated with the disturbance on y-j^. Neither 

of H:hese correlations follow deductively from the assumption 
of reciprocal causation . What cjoes follow deductively is that, 
once nonzero correlations are assumed between independent 
(inn.:>sured) variables and disturbance variables. OLS yields in-^ 
c J-;?tent estimators of the effect parameters in the structural 
equations. 

Equations (36a) and (36b) er»ch appear as a linear equatibn 
-with a stochastic error term (disturb'-"^ nee variable) ; hence, one 
might contemplate using ordinary least squares regr^iSsion 
to estimate the parameters — ^i j ' ^ij* The formula for 0^,S 

, estimation of equation. (36a) is 
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(37) [p^Q, Pii' ^12^ 



'^J^l^' ^^2^ij^ 



N 



2j 



j 3 2] 



-1 



where p ^ p , and q ^ are sample estimates of the correspond- 
iny population values P2_o' ^11' ^12' ^pspect ively , the j 

subscript stajids for the jth case, and N = the number of cases. 

To simplify the presentation, it is convenient to shift 
to a__matrix nota ^ion . 

Accordingly, let 

^ an 1 ^ N vector of observations on 

W ^- a 3 ^' N. matrix with the first row containing N con- 
stant values of 1,0, the second row containing N ob- 
servations on Xf and the third row containing N Ob- 
servations on y ^ . Each column of W represents a 

single case 

v an 1 >^ I* vector of values for v^ 

V 

^ r ^ a 3 ^ 1 vector of coefficients; 

«. 

- " ^^10' ^'ll ' "^12^ ' 



where the priine stands for transpose - 
Equation (37) can now be written ■ 



(37a) r^' = yW' (WW' ) 



-1 



where r is the OLS estimate of r. Note that (37a) is the trans- 
pose of most notations for simultaneous linear systems; the 
transposed notation is used to maintain consistency with the 
notation for the differential equation system'. The differential 
equations are Resented in notation that is prevalent in the 
mathematical literature. 
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To evaluate this estimate, write (36a) in tho matrix nota- 
tion for N observations — 

(38) il = r • W ,+ V _ , ^ ' . 

and postmultiply by the transpose of the independent variable 
matrix (W ' ) -- • 

{39a) yW' - £,'WW' + vW' 

■(39b) r' = ^'(WW')""^ - Y^'^M'^ 

assuming WW is nonsingular. For a given sample , subtracting ' 
(39b) fronT"! 37a) gives the sampling error: 

' -1 

('' 0) r ■ - r' ^ y}l ' (WW* ) 

-1 

Thus, r is unbiased only if EvWMWW') is zero. In general 
this expected value is not zero . Even worse , it does not tend 
to zero as the sample size increases. ItL probability limit 



IS 



plimi vV7' (WW' > plimf VW' ] "Ipliir (WW' )"'^] = 







> 




whore plim stands for probability limit, and 
'Wr^-tten out this becomes 


wis 


a column 


from 


-1 

E(v^wM [I^(ww;) ] ^ f^^i' Ev^2, Ev^y2] 


.1 


' Ez 






Ez 




Ezy2 




Ey2 


Ey2^ 


ey2 



5ince, by assumption Ev, , Ev z are zej;o, the difficulty arises 
from the nonzero expected value, £v^y2. As noted, this non- 
zero assumption is made because y^ is hypothesized to be af- 
fected by y^r and it therefore seems unreasonable to include 
in the assumptions t^e postulate that the disturbance for y^ 
is uncorrelated^ith y^. (Note that Ev^y2 is the numerator 
to the corrc^lation bet-'?een and y2/ since Ev^ - 0.) 
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This argument was developed for. equation (3^a) . An analo- 
gous argument ^pplie^s to C36b). Clearly, the sajae argument is 
quite general, applying to every equation in systems of any ,di- 
nujntions. Thua , it is concluded that OLS estimates are incon- 
•feistent (i.e., do not converge on jjthe population parameters . 
they purport 'to estimate as N goes to infinity) , when -one or 
i.iore of the regressors is correlaced with the disturbance var- 
iable in the associated structural equation. 

Consistent estimation of the structural parameters, p^^. 

and q . is approached by way of a re luced form set of coef- 

ficients. -To explore methods based on the reduced iform, it is 
instructive' to "rewrite equations <36) in the following way. 

0 - ^ho ^ Pll" - ^1 ^ ^12^2 ^ ^1 

.0 - P20 ^ ^'21^ ^ 'hl^^'l ~ ^2 ' ^2. 

T'-'ii pair of equations can be no ■ ctly written' in matrix no- 
tauion for L exocjcnous variables, K endogenous variables, an<a 
N obi^orvations: 

{4 1) q pz + Qy' + V , 

' N 

wlu^ro 

f. 

a matrix whose first ^^rC^' contains all ones 
and the remaining rows contain N observa- - 
tions on each of the L- exogenous variables 



a matrix who^e rows contain N observations 
on each of K endogenous variables * 

a -matrix of K disturbance -variables for !*• 

cases w , 

a matrix of constant coefficients including 
the intercepts for each equ-ation in the sys- 
tem and the parameters indexing the effects 
of each of the L exogenous variables on each 
of the \ endogenous variables 

a nonsingular matrix of coefficients indexing 
effects of tf>e endogenous variables on «ach 
other. The diagonal elements of Q can be de 
fined to equal -l.C to main+.ain consistency 
between (41) and (36) . 



y 



V 
K'-N 



K - (L+1) 



0 
K'<I 



O =. a conformable null matrix. 



Again, note that, to -maintain consistency with the previous sub- 
section, equation (41) is the transpose of notation most preva- 
lent in - the econometric literature . 

The reduced form of the system is obtained by premultiply- 
-1 

ing (41) by Q (assuitfed to exist) , and solving for Y. 

0 - (Q~'^p) z + y + □""■^v 



(4 2) Y = nz + V 



where 



-1 ... , . 

,{4 3) n ^ -Q^ P = the reduced-form coefficients matrix 

* • 

(44) V* - -Q^'^V 

Since 7. is uncorrelated asymptotically with V, (1/N)plim V*Z' = 

-1 ■ ~ * 

-1/N plim Q yz'.= (-1/N)Q plim VZ' = 0. Hence OLS can be used 
to estimate consistently if in equation (42) . The estimate is 

-1 ~ . 

II ^ Y2* iZZ') ^ 

' s 

Estimation of the structural parameters P and Q may now 
proceed by makinq use of (43). Premul tiplying by -Q yields 

(4 5) -Q!: - P . 

Giv>*Ti a sample estimate of the reduced form matrix f one may 
write thc^ sample version of (45) , and investigate how to use 

the relation for finding P- and Q; 

. (4[?a) -QIl = P • - ■ ' • 

In general, (4 5a) is not identified. There are K(L + 1) known'" 

values in -11, but K(K - 1) unknowns (excluding diagonal elements, 

set to -1) in Q plus K (L + 1)" unknowns in P, Obviously, one 
must draw on substantive theory to cohstrain the values of P 
and/or Q. Generally, certain elements of P and Q are assumed 
zero, and the altered relation implied by, T4 5a) and the z^ro co- 
efficients is ^amined to see" if the remaining unknowns can be 
calculated. Numerous methods are available for such calculations 
(see Coldberger, 1964 and Johnston, 1963 for' thorough reviews). 
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The immediate goal is to find suitable estimates of the 
matrices A* and B* in the integral < equation (33)* This estima- 
tion probrom may*^be interpreted one of estimating the reduced 
form of a general simultaneous structural equation system. To 
see this, it is necessary to introduce some additional terminology, 
The variables of a simultaneous system frequently are classified 
into one group termed p redetermine d variables and a second group 
called joi ntly dependent variables. Predetermined variables in- 
clude exogenous variables and lagged values of the endogenous 
variables, and the jointly dependent variables include only cur- 
rent values of the endogenous variables.. For instance, in the 
integral equat ion- system (33), x represents exOg lous variables 
and the lagged endogenous variables are denoted >y the pre- 

determined variables include x and y^. The jointly dependent 
variables are denoted by y^ . In scatistical applications^ the 

matrix Z contains predetermined variables, and Y contains joint- 
ly der>cnde.'nt variables. 

To pursue the argument, shift equation (3 3) into the no- 
tation -of structural equations developed above. Let ^ ■ repre- 

sent a K X 1 vector of values on K endogenous variables at time 
t for case j, and let be a (1 + L) >^ 1 vector with its first 

element equal to 1.0 and rem.aining elements containing values ^ 

of L exogenous variables for a single case. Also, let v,, repre- 

j 

sent a column vector of K disturbance variables for a single 
case. Now, define 



Z - /X. . . . 



V 



01 s -ON 



K X ii 



p - 

Kx (ULiK) 



Q . -I_ - The ncyaLive of a K x K identity matrix 



7,9 



How the ii^tegral equation (33) can be written 

{4^ Y = PZ + V 



(46a) O - PZ + QY + V 

Equation {46a) is precisely the same form^as the general struc- 
tural system given in (41) , with Q -I. Estimation via reduced 
form may proceed directly by reference to equation (45) . Clear- 
ly, when Q'- -I, (45) yields: 

^ (47) p - n 

Since P contains the structural coefficients for the predeter- 
mined variables and 11 is the matrix, of reiduced-form coefficients, 
(47) establishes the desired result: Est'.mation of the para- 
meters of the integral equation (33) may be viewed as estimation 
of the reduced form of a general simultaneous structural-equation 
model. (See Goldberqer, 1964: 373ff for a similar interpreta- 
tion.) Thus one may consistently estimate A* and B* by the 
following f' aulas. 

lA , B ) = P 

-1 

(48) P = YZ • (ZZ' ) 

The asymptotic variance-covariance'matrix for each row of P 
(equation of [33]) of these estimates is given by 

(4^) ; = l/N s^ ^ fE(zz')]"^ 



wPiere 



' ^ the covariance i?atr iK for the sample estimates 
^ ^ in the Tth fow of P 



- the variance of the disturbance for equation i 

V 



1 



) 



E(7.z') = the covariance matrix among the predetermined 
"~ variables. 

Generally, the quantities in (49) being population values, are 
unknown. However, consistent estimates can be formed by cal- 
culation of analogous sample values: 

% • 

\ 

(49a) C; ^ = (1/N)s^^ I (l/N) (ZZ') j"^ 

■"2 ' -1 
(ZZ') 



\ 



where the circumflex indicates a sample value for the correspond- 

2 ^2 
ing population statistic. Note that s^ - (1/N)E v^^ = 

s*" (1 - R ) , Where b is the sample variance of the ith joint- 
Yi 1 ' y. 

ly dependent variable, ai}pd R^ is the square of th^ multiple cor- 
relation for- equation i. A slightly improved estimate is given 
by replacing the reciprocal 1/N with 1/(N-K-1), where K is the 
number of rt?qressors. 

As noted in the beginning of thirs subsection, there are 
i.jmerous methods for estimating A* and B* other than applica- 
tion of OLS to two panel's of data. Consequently, it is impor- 
"tant to identify the key assumptions underlying this application, 
and to sur.\marize briefly alternative methods. The first two as- 
sumptions are substantive rather than statistical. They are: 

Assumption 1: Ail individuals are governed by the same 
differential-equation structure. This means that the 
matrices A and B (and, therefore. A* and B*) are the' same 
for all persons. 

Assumption 2:.- Tlic ^f f erential-equation structure is sta- 
tionary over time. Vhis means that all values in the co- 
efficient "matrices A and B are f ixed^ over time. 

It is possible, though improbable, that statistical assumptions, 
to be reviewed presently, could hold even though assumption 1 
and/or assumption 2- are not valid. In such a case, one would 
produce good statistical estimates of parameters that provide 
seriously incomplete description of the substantive topic. 

r 

The next two assumptions are necessary (and sufficient) for 

the statistical estimates to be consistent. 

•>^( 

Assumption 3; The means of dh&turbance variables 

are 2:cro . ' 

Assumption 4; The independent variables z are not corre- 
lated with the disturbance variable y v. This means that 
all exogenous variables x and lagged..endogenous variables 
^ aj;e not correlated with any unmeasbred variables v. 

If assumption 3 and assumption 4 hold, OLS estimates are con- 
sistent, meaning that; . (a) tJ»ey are unbiased in the limit as N 
approaches infinity (i.e., asymptot ic'ally unbiased), and (b) their 
^sampling variance approaches zero as N approaches infinity, 
(^he latter feature implies the former.) 
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The next assumption is re^guired for derivation of the var- 
iance covariance matrix aii^ong the regression coefficients of a 
^iven equation (equation [49]'). 

Assumption 5: The variance over samples of each dis-- 
turbance term is the samq for all observations and it 
does not depend on z. 

If assumption 5 is not mot it does not imply necessarily that OLS 
is inconsistent. Rather, it means that statistical tests of 
significance should not be applied using formula (49a) to es- 
timate standard errors. In this case, however, generalized lin- 
ear regression (GLR) may be preferable, to OLS, because, while 
both OLS and GLH are consistent, the sampling variability of 
GLR is less than for OLS (i^.e., GLR is more efficient , see 
Goldb ^rgcr / 1964) . 

AJiy one of these assumptions poses a serious threat to the 
valid use of OLS to estimate A* and B* , because status attainment 
theory is not sufficiently developed to generate confidence that 
any of the assumj^tions are met. It should be emphasized, how- 
ever, that no statistical method will compensate for incomplete 
theory. Thus, one should not expect to find the situation MUCh 
improved by resort to alternative statistical procedures. 

In the statistical literature on simultaneous structural 
equations, assumption 4 has been of major interest. As review. d 
above, when assumption 4 fails (i.e., regresses are correlated 
with the d istuVbance) , OLS is biased and inconsistent. If OLS 
'estimates are inconsistent, then numerous alternative estimation 
* methods miqht be substityted; examples of alternative methods 
include instrumental variable estimation (IVE) , indirect least 
squares (ILS) , t?Wo-stago least squares (2--SLS), full-information 
maximum 1 IVo.l ihood (FIML) estimation , and three-stage least 
squares (3-SLS) . All of these methods depend on measuring a 
set of exogenous variables (or instrumental, variables) that have 
rto direct effect on tMo jointly dependent variables but are un- 
, correlated With the di sturba-ice variables . 

The bias and inconsistency of OLS when it cannot be as-- 
sumed that regressors arc uncorrelated with the disturbance 
is well known, but a similar resuTt for alternatives to OLS has 
hot been widely publ iciz^ed . To emphasize the point that stat- 
is' il methods cannot substitute for matare theory, the case 
of instrumental variable estimation (IVE) is examined below. 
Not surprisingly, it is found that JVE estimates are biased and 
inconsirtont when it is assumed erroneously that direct e^^^fects 
of the instruments are zero. It should be emphasized that IVE 
is equivalent to indirect least squares , and for "just identified" 
systems, also equivalent to other methods snc:h as tv;o-Btage 
least squares. 
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Instrumental variables for a given structural equation are 
variables tuat are uncorreiated with the disturbances for that 
equation and are correlated with the independent variables in 
the equation. Suppose that .V7 is an (L + 1) x N matrix with the 
first row containing all ones and the remaining rows containing 
N observations on L instrumental variables. To avoid defining 
additional notation, assume that W is uncorreiated with all dis- 
turbance variables; in this case EVW = 0. Referring to equa- 
tion (41) and using the fact that 0 = one may write 

O - PZ - Y + V 

Y " rz + V 

-1 

The instrumental variable estimation of P is P = YWMZW') ; 
by tl.e assumption (ZW')"'^ exists. The sampling error for P is 

>1 -1 ' . 

P - P ■■■ YW'CZW') -P = (PS + y.)W'(ZW') -P 

- £ VW ' ) '"^ - P 

- vw' (zw ' ) 

TakiiKj probability limits, one finds 

pUm(P-r) = plim VWMZW')"-^ = plim VW' plimCZW')"^ 

-1 

-- Evw' (E7.w') 



^ 0 

where z, w are columns from Z and W, respectively. 

Hence , un icr the stated assumptions, i.nstrumental variable es- 
timation vand indirect least squares) is consistent, and, as 
shown earlier, OLS is not. This is a basic result in the econo 
metric literature. 

An implicit assumption of the IVE is that the instrumental 
variables exercise no direct effect on the' dependent variables. 
To sec this, suppose that some or all the instruments do exer- 
cise some effect on the dependent variables and let S bo a 
conformable matrix containing the effect parameters. The ob- 
s«rvations now are generated by the fol^ov^ing equation. 

y - pz + s w + V 

% 
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Suppose that one^ assumes mistakenly that S - 0 and forms IVE es-- 
tiftiatesr P ^ YW'(|W') . The sampling error ^i'or P now becrojnes 

(pz ^ sw ^ ^ L 

- s (WW' ) (zw' ) "•^ + {VW' ) (ZW' ) ""^ 
taking probability limits gives 

-1 

. ' " . A . . . 

Thus , it IS clear that instrumental variable estimation generates^ 

inconsistent estimation of P wheti it is assumed incorrectly that 

the instruments have ^ no direct effect on the dependent variable 

{i.e.^ incorrectly assume S = O) . 

This is hardly a surprising result ^ but it does emphasize 
the point that methodological techniques cannot substitute for 
adequate theor y . Whatever method is used , assumptions that can- 
not be tested against data must be maiie' . Confidence in the re- 
sult of the calculations must necessar ily rest heavily on confi- 
dence in the theory. 

The above discussion introduces the general question cjf 
identification . Before any statistical method can be applied/ 
theoretical assumptions must be imposed. A thorouah presenta- 
tion of the identification problem is beyond the scope of this 
monograph/ but a brief treatment is appropriate . 

# 

Assume that y represents one case on one of the dependent 
variables (a scalar). Also, lot ^ represent one row of P cor- 
resp*onding to y , and let v be the corresponding sc'aTar dTsturbance, 
Let 2 be one column f^'^om the matrix Z . With this -notation / the 
model can be wrirten: 

« , 

y - ££ + V 

postmul t iply ing by z' and taking expectations on both sides, one 
nvay derive estimating equations. 

Eyz' ^ E^^* ^ ^'^^^ 



or 



Ezy = (Kzz.')£' + Kzv 
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The second form expresses the system of estimating equations in 
conventional aoluran -vector for©. In the system, £ and Ezv are 
unknown, and Ety and Ezz' are assumed knovm. There are L + i 
rows, hence L + 1 equaOons. There are L + 1 unknown elements 
in p* aid L + 1 unknown elements in Ezv (including the mean ot v) 
Letting I represent an identity matrix of order L + 1, the sys- 
tem can be written as a single, system: 



E2y = IE : l) 



[Lv] 



Thuy, wo hnve a system of L + 1 linear equations m 2(L +1) un- 
knowns.. The maximum rank of the supermatrix of known coefficients 
\Ez7."t I] is, therefore, L + 1. The i-dentif ication problem re- 
soU^cs into the process of drawing on theory to place at le&st 
L 1 independent restrictions on the system. Assuming the ranK 
of the system is L + 1, when at least L + 1 additional linear 
restrictions are imposed, the system becomes identified. As it 
stands, it is under identified . As the previous discussion im- 
plies, the most common type of restriction i? to a&sume certain 
elements in P' and/or Ezv are zero.. A total of at xeast L + ,i . 
such assumptions are necessary (but -not sufficient). When it is 
assumed' that all elements in Ezv are zero, OLS are consistent 
estimators, but when some combination of elements from £ and 
Ezv are sot to zero, some different method must be used. If 
exactly L + 1 values are set to zero, then the system is 
identified, and, if some nonzero elements are in Bzy, methods 
sucirii~Tndirect least squares or two-stage least squares-atre 
identical to instrumental variable estimation. If more than 
L -t- 1 coofficients are set to zero, then the system is over- iden- 
tified. If some of the nonzero elements are in Ezv, then methods 
ili^lT^s two-stage least squares, three-stage least squares, or 
fuU-information maximum likelihood must be . used. The reader is 
referred to standard texts in econometrics (e.g., Johnston, i^^bj? 
• Goldberger, 1964) for exposition of these methods and a t^iorough 
treatment of the identification question (on the latter is^sta^, 
see also Fisher, 1976) . 

Although economic theory sometimes may be powerful enough . 
to specify more than the minimum L + 1 zero coefficients, status 
attainment theory does not justify even the minimum L + 1 as- 
sumptions with sufficient authority to generate strong confi- 
dence in any estimation technique (see the discussion of a paper 
by Haller and Woelfel ll&71a] ~- Land, 1971; Henry and Hummon, 
1971; and Woelfel and Haller, 1971b). Henc^, it -seems unlikely 
that there will be inkhe immediate future much ^ustif igation 
for apply' mf-thods in status attainment work -such as two-stage 
least squares that are designed for overidentif led systems. 

To il'^ustrate these comments, consider again the system 
represented by equations (36). Let z be parental status, y^ 



represent significant other's occupational expectation of ecjo, 
ai>d indicate ego'si own occupational expectation. Because 

there is causal*, feedback in the system, the standard presumption 
is that is correlated wi-th v^^ ai^d yg^xSrCprr elated with v^; 

hence, OLS are inconsistent^ and should not be u^ed. Although it 
does seem plausible that ^iy2 ^ 0' ^^2^1 ^ ^' state- 

ments do not follow deductively from equations *( 36) . On the 
other hand, theory ^Suggests , but certainly does not confirm, 
that the effects of parental background on ego's occupational ex- 
pectation operate indirec.tly via significant-other occupational 
expectation of ego. If one is willing to make this assumption 
and set p^i = ^' then consistent estimates P20' ^Z1' ^'^^ 
%.^2 secured through indirect least squares. 

This situation is- a specific example of the general oondlu- 
sion reviewed above. Without some restrictive assumptions, a 
unique solution for all unknowns in the eduation {36b) , 

^20' ^21' ^21'^^2^1' ^^-"^^ot be found. It is plausible that 
^21 ^ and, Ev2y^ 0*. If these assumptions are accepted, then 

indirect least squares are appropriate. It is almost as plausible 
however, that q2^ ?^ 0 , and EY2yj^ = 0, thus indicating use of OLS. 

Thf: most likely case isV^hat neither coefficient is zero. 

The example is typical^f 'the dif f icul ty- of identifying 
structural equation models in status 'attainment ^research (se^ 
Nolle, 1973, for example). All published work includes a larger 
set of variables than the example, but each independent variable 
added to dn^ equation adds two unknowns, but- only one new equation 
to the OFtimatinq-equation '^et. Hence, adding variables is not 
sufficient to identify the parameters of any of the striiptural 
equations. What is required, is' better theory, but good" theory 
;tak^s time to develop. In the 'meantime , exploratory analysis 
/based on available theory and data is necessary. As indicated . 
here, Appdication of OLS in such exploratory analysis frequently • 
may be as easy to justify as alternative methods such as in- 
direct least squares, presence of feedb^ick, loops in the system 
not withstanding. It does appear, however, that use of more thfn 
one estimation me' : od frequently could contribute some insight 
into the substantive questions under study and serve to emphasize 
the eKi>loratory nature of the analysis. . .'^ 

One of the main themea of this volume is that tests of theory 
by checking the' accuracy oj^ forecasts derived from theory con- 
sti^utes a promising tool for improving the quality of theory. 
First, forecast accuracy is- a much more stringent test of theory 
than most techniques in general us'e curi'ently. If theory fai,ls 
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to generate accurate^ forecasts, then scholars are led to reform- 
ulate %he jth^^y. li t^eqry consistently does -generate accurate 
forecasts / then confidence in it will be strengthened consider- 
ably over what can be justified by reliance on multiple correla- • 
tions. Secondly, empirical tests using forecasts should stimulate 
forrhulation. of dynamic theory to account for the behavior of all 
variablo^ at every instant on the continuous time scale. Such 
dynamic formulation represents a marked improvement over cross- 
sectional models that are so prevalent in the literature. 

Much confUsion is associated with selection of an approp- 
riate estimation method. For example, one, paper on statistical 
estimation with panel data statep : 

Whenever a lagged dependent variable is included 
in the model, the errors will be correlated with at 
\east one of the regressors, and ordinary least squares 
rcqression will be biased and inconsistent (Hannan and 
Your.g, 197 /) . 

As noted above, this statement is inerror . It does not follow 
doductxvely that lagged endogenous iShriables are correlated with 
anv disturbance variable, including the disturbance associa;ted 
with the current value of the same variable. Such a correlation 
might bo a lively hypothesis, but it does not automatically 
follow from the model. 

Two rea-.ons emerge from the above discussion suggesting 
that OLS estimation can be recommended as an initial methodology 
for estimation with two panels of data. First, the required 
assumptions have been made explicit and are at least as plausible 
as alternative assumptions. With OLS, one need not assume any 
of. the coefficients in A* or B* are zero^but must assume all co- 
variances between the disturbance^ variables y and the exogenous 
variables and lagged endogenpus variables are zero. Non OLS 
methods require that some elements of A* artd/or B* be' .s^'^ned 
z€;ro, but permit estimation of som^: cpvariances Involving qas- 
turbancc variables.. Secondly, OLS selects parameter estimates 
to minimize the variances in the disturbanpe variables. If 
the theoretical modea is approximately accurate, then the min- 
'imi^atiion criterion should generate good forecasts. If it does 
not, then alternat4ve estimation m^!\thods should be tried. 

• ^ . ■ ■ • 

Probably' the main threat to consistent estimation using OLS 
is the autocorrelation of disturbances, meaning that the un- 
'measured variables are correlated over time. Autocorrelation 
certainly would arise, for example / if phe or more important 
variables -i^ omitted from the analysis .and the omitted variables 
are correlated over time, ; veh, a.firSt prder autocorrelation 
process for the disturbai . valri^bles, the correlation matrix ■. 
between, lagged endogenous variables ^^^^^ and current disturbance 
v. , can be written, •■ 



wh^re r is' the matrix of . aatocorrelations, A iSv the ina'tri:^40£ 
cross-sectional variances and covjatiancfes ampftg th/e disturbances; 

■ K . ■ V ' \ ■ * ' • < • 

^« .Ev^v^', all t. It is assumed .that A < ,l^arvd that the pro- 
cess h^s operated* over n periods with "h sufficiently large so 

that r" 0. This covariance is unlik'ely to be zei#o,'but note 
that Tt approaches -zero as.teititer T. i±he i natr i^x o l f au L ocorrfr la~ 
tions) approaches 'zerO' or as A (the cross-sectional, variance- 
covariance matrix among re§iduaj.s) approaches zfero. Since the 
diagonal elements of A^are error Va^-iances, one sees that a 'model 
accoui^ti'n^ for ^ larg^ proportion of ^^^^riance helps to keep the 
asymptotic bias of OLS low,.- A sjjort measujrement interval probably 
contributes to small A but tabdsf to generate large F; hence, it- 
is difficult to draw conclusions I about the appropriate length of 
the measuremefit interval. ». 
- ' • 

- This discussion has. been confined to estimation methods 
based on two panral^ of .dat^^ because this' seems like the -most 
likely data base in ^tatus- attainmeJxt work in the iiimediate fu- 
ture. T&e reader slSaldrbe reminded,- howler, that estimation 
is possible with a tme series on a giv^n c^se extending at 
least to one more observation than number of variables in the » 
model. For presentation' of estimation methods with time series 
data, see Ostrom (1978) or Box and Jenkins (1^70) . Econometric 
texts show how to handle time series data. Doreiart and Hiammon 
*(1976) give es^ples of estimation with time series. In ad- 
dition, pool»ing of time series of cross sectionls sometimes pro- 
duces improved estimation (see Hannon- and Young [1977] for a 
review and survey of recent literature) . , ^ , 

In sum, it appears that use of OLS to estimate the coef- 
ficients c.the integral equation (33) is justified as an 

\ initial strategy, but that alt*ernative specification might fruit»- 
fully be investigated in future research. In particular , ^ it 
seems advisable to e^xplor^ hcJw €he dynamic quality of dif- 
ferential equation model can be exploited to a^ress the stat- . 
istical issues. (See Doreian and Hummori (1976) for several il- 
lustrations o^ estimation techniques tailored to specific sub- 
stantive questions.-)- Also, specification of a model Recounting 
for measurement error is desirable (see, e.g. ,' CoItJIah , 1968; tf 

^Joreskog, 1973? Wheaton, et al^. , 1977), but it- is "beyond th^ scope 
of the present volume. ' • .• 
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Computer Calculations 

' • ■ • 

•The nnSain purpose of this section is to describe a computer 
prograift 'package which is available for converting raw-:data in- 
put into estimates of the coefficients bf the differential 
equation (28) (the matrices A ^and B) * by 'usi^l OLS to estimate 
the parameters of the corresponding integr a liquation (29) (the 
matrices A* and B* , contained in the matrix P) . The pr^sehta- 
S tidn proceeds in four parts, ^irst, a. brief discussion of the 

calculating task is presented. Secondly, the input required to 
use the computer program is described.. Thirdly, the output of 
' the program is described. Finally, provision in the program 
package for carrying out data transf oirmations is described. ^ 
t ' " • • ♦ .' 

The Calculatirjg^ask 

• ■ 

fl / "The logic of the program is »quite simple. The calcu?.ations 

proceed in three steps: ^ 

I 

1. Calculate means, standard deviations , and correlations 

2. . Calculate OLS estimates of P = [A*, B*] , using the 

means, standard deviations, and correlations as input 

3. T Calculate estimates of the coefficient matrices A and 

B from A* and B* , using equations (30) and (31) 

Calculation of means and, standard deviations by the program is 
executed by omitting missing observations on each variable and 
dividing the accumulated sums by the number of- observations 
present for- each variable. Correlations are calculated using 
all data present for each pair of variables. 

T"he only unusua\ calculations are associated with finding 
the matrix logarithm demanded by equation (30b) . Tl\is calcula- 
tion can be accomplished by finding the characteristic roots 
and vectors of the matrix B* and applying formula (20). Since 
B* is not a symmetric matrix, its characterhstic roots and 
vectors may sometimes be complex numbers; hence, complex-number 
arithmetic is required. To find the eigenvalues end eigen- 
vectors, tfie program package uses subroutines from an eigenvalue- 
eigenvector package called EISPAC distributed by The National 
Software Center at the. Argonne National Laboratories in Chicago. 
^' ^ All programming is written in FORTRAN IV, Level G. 

■ I ^- 

The calculations ,are executed on the assumption that the 
• matrix B is full rank.' This is not a serious limitation, how- 
ever,' slnc^ it is rare to find a singular matrix in empirical, 
data. 
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• Input to the Pro-am 

To operate the program, three types of input are required: 
a) data containing N cases for all variables, b) control cards 
describing the data, and c) system-information cards. . r 

Data Input 

The program assumes that observations for all variables are 
grouped together for each case (e.g., individual respondent) , -and 
that the position of each variable on the cards ^ tape or disc is 
the same for all cases. To illustrate the proper arrangement of 
data, figure 7 shows the arrangements ~on cards for three vari- 
ables and five cases, e'ach case being a person. 







Col. No. : 


ir3^S67a90123^567 


card 


1, 


person 1 


75 


6.9 


.46 


ca~rd 


2, 


person 2 


2 


.3 


.82 


card, 
card 


3, 


person 3 


32 


3.8 


.19 


4, 


• 

person 4 


65 


7.4 


.56 


card 


5, 


person' 5 


42 


.9 


.94 



FRir 



Figure 7. Illustration of organization 
of Input \|)ata 

{Of course^ the decimal . points shown in t-he illustration need 
not be punched on the cards, as a FORMAT statement- can be used 
to ^position the decimal for each number.) If there are too 
many variables to fit on one card, ifontinu^tfion cards can- be 
used for each case. There may be a^y number up to and-' includ- 
ing 1000 variables in the data set. The |>rogram selects the 
variables^ needed for a given analysis in what^v'er order desig- 
nated by €he user, according to information ,supplied by the 
control cards.. The data may be stored according to a' FORTRAN 
FORMAT, or they may be unformatted, using FORTRAN unforma,tted 
input-output. The mode of data storage ^nd FORMAT statement, 
if needed, form part of the information supplied to the program 
by the^controO. cairds. 

^ 

Control Cai?d^s 

__ - ^ , ^ ^ ^ 

There are three control cards which supply information 
aboyt the data to the^ program, The three cards are a) RUN CARD, 
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b) DATA CARD, and c) FCtfeMAT CMRD» The content of ^hese three 
types is desgrilbeS* below. 

One RUN CARD is r.equired for each analysis, ^nd it must ap- 
pear first. The RUN pARD supplies .seven', parameters to, the pro- 
gram; the information, location on ;the RUN. CARD and dfef ault w^l'pe 



for each parameter ar'e described below. 



pai^am^te: 



Location 
Xuolumn . 
Numbers) 

1-5 
1 



parameter 2 (NIV) : 6 - 10 



parameter 3 (NR) : 11-15 



parameter 4 (ITyP):16 - 20 



parameter 5 (NU) : 21 - 25 



Y 




4 



Purpose 



Indicates the number 
variables ±n the analy- 
sis ■ - ' 



Indicates the number of 
exogenous, variabliea in 
the differential equa- 
tion system 

Indicates the nxanber of 
variables in the data . 
set from which data is 
to, be read, or the num- 
ber accounted '.for in ' 
the FORMAT CARD,' if 1. . 
the latter ds less than 
the former 

Indicates whethe^^^the " 
data ate .formated 6^ un- 
formated; 0 = -unformatr 
ed; nonzero indicates 
the nu^er of c'ardSy up 
to six, on. which the^ 
FORMAT *is punched ^ 

Indicates the unit num- 
ber associated with 'the 
READ statement used for 
datja input. Selection 
of the 'unit is described* 
in the text.un^er systtem- 
inj^ormation cards V;^ 



Default 
Value 

^ None 



None 



^1 
equals 

to 

para- 
meter 
1 
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Location 

(Column 

Numbers) 



parameter 6v (T) : 26-30 



..s 



Purpose 



Indicated' the niunber . 
of Wnths elapsed be- • 
tween panel 1 and 
panel 2 



Default 
Value 

12 • 



paraftieter 7 (MIS) : 31 - 40 



A - 



Indicates the nvmer- 
ical value used to 
indic^e missing dat^ 
(Jtf the missing data 
code is not the same 
for all variables, 
transformations must 
be executed *to change 
all missing data codes* 
Jto the same value. .^The 
last subsection, en- 
titled "Data Transforma- 
tions" illustrates how 
this can be done) . Zero 
is an invalid missing 
data code . 
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At iea^t .one DATA CARD is required for each analysis, and 
should follow the RUN CARD. The DATA CARD indicates the var- 
iable sequence number of each variable to be Used in the analy- 
sis. Each sequence number is right justified in a five column 
field — the ^rst sequence number appearing in columns 1-5, 
the second appearing in columns 6-10, etc.- (Any number of 
continuation cards may be used.) If the data are formatted 
(1 < ITYP < 6; ITYP parameter 4 on the RUN CARD) , then the 
sequence aumber gives the order in which 'the variable appears 
in the FORMAT statement supplied by the FORMAT CARD. If the 

tta are .unformatted (ITYP = 0) , then the variable sequence 
"mber indicat€^s the order in which the variable appears in the 
pUt data set. If for a formatted dat^set, a FORMAT state- 
ment is supplier describing all variables in the input data set, 
then the sequence number gives the order of the variable in t|ie ' 
data set for • formatted data, just as it does .for unformatted ' 
data. If the data are formatted, tt is recommended that a 
standard format describing all variables in the data set be sup- 
plied; this procedures avoids the need to cOlistruct a new FORMAT 
CARD for eaph analysis^ thereby reducing the numerous chances 
for ^rror inherent in^ons true ting FORMAT' statements . 

-in each analysis, the variables are ordere^ in tJie same 
order in which they appefar on* the DATA CARD. ' The program assumes 
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that exogenous variables are listed first, followed by the endo- 
genous variables ior paael'one, then by the endogenous variables 
for panel t wo. The endogenous, variables for the two p anels • 
should be listed in th& DATA' CARD in the same orderi_as^ the order 

:iables 



of the enaoqenous var: 
the FORMAT statement ar6, 



fo r panel 
used in 



one. 

tHe 



If all variables xn 
order in which they 



^..^ , to be 

occur in 't-he , FORMAT , the^ one may insert- a single blank DATA 
CARD. ' " . , 

One FORMAT CARD is required if the input data a^ formatted, 
but should be omitted if the input data are unformatted. If 
• included, the FORMAT CARD follows the DATA CrJlD. The FOR^^T 
statement is limited to six cards." All the rules for FORTRAN 
FORMAT statements apply, and, since th^e rules ar^ Widely avail- 
able in other sources {e.g.,-^he SPSS Manual [Nie. 'et al . , 197531,. 
and numer«ius texts on FdRTRAI*^ogramming) and fartiifiar to 
^readers/ tehey are not ^reviewed here. It should be noted, 
%\feK, .that th6 wotd FORMAT does not appear on ^^he FORMAT CA] 




To illustrate ^he use of control cards, assume the i: 
data set is formatt:^ and contains 100 variables. For th6^ cur- 
rent fun, suppose there are two exogenous variables and fiv>^ 
endogenous variablejs, each of the I'atter measured twice — on\se 
at t and once'at t, , giving twelve variabiles, in %he cii&rent run/. 

t o , 1 • * 1 £*• ' 

A^i-ome that all lOO^variables appear on cards in ad^aceiVt f iv^ 
polumn 'fields (thus T»equiring seven cards per case^ , andHhat 
the twelve variables for the current run are arranged as follovje 
or^^'the cards: , . * • , 



Symbol 



y 



o2 



^o3 



o4 



^o5 



'11 



J. 



>■ Name 

first exogenous vari- 
bie - 

second e:fe©genous 
variable 

first endogenous * 
va:^able, 't^ 

second endogenous 
variable, t^ 

third endpgenous 
variable, t^ 

fourth endogenous 
variable, t^ 

fifth endogenous 
variable, t^ 

first endogenous 



Card 
No. 


Coliimns 


Sequence 
Numbers 


1 


6 -10--, 


2 ^ 




21-25 


= / 


1 


51-55 




• 


41-45 


9 


2 . 


21-2 5 


21 


2 


61-65 


29 


3 


11-15 


35 


3 


26-30 


38 



* 



I- 



\ariable, t^^ 



V. 
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*' • , '.Card * Sequence 

Symbol Name No. dolmans • Niirobers r 



12 
13 



'^y^ second endogenous • 3 4j6-50 42 

variable, - 

= third endogenous 3 • 61-65 45 
variable, t, 

y; . '= fourth endogenous . 5 - 31-35 71 

variable, t^^ ^ ♦ 

y,- = ^ifth endogenous 3 31-35 39 



variajple, tj 



jnVthis 6ase the sequence' number follows. directly from the card 
and column numl^rs. Assuuning every five'-column field containe^ • 
one and only one of the 100 variables, the sequence number whs / 
calculated by: sequence # «' 16 x (card #, -1.0) + (last col. # " 
in field) /5./ These sequence numberi^ give the order in which' the 
variables appear in the iriptit data. Further, suppose jbhat any 
missing data is coded 10^, (1.E9 on the cards), that twelve mont;hs 
separate t^ from t^, and that -^t is. desired that the variables ' 

appear in the analysis in the order as liste4 in the above tab-x 
ulation. The control cards shown in figure 8 can be used to 
operate the program.- 

' 12 2 100 1 .(SUN CARD) 

2 5 11 9 21 29 35, 38 42 'AS- 71 39 (DATACARD) 
(16F5.0) . ^ (FORMAT CARD) 

Figure 8.' Control ca^'s with FORMAT for all variables ^ 

The RUN CARD indicates -12 variables 4n the analysis, 2 exogenous 
variables, 100 variables accounted fqr by the FORMAT CARD, and 
formatted data with one .FORMAT CARD. Default values for the 
other parameters on the RUN Ch^D are assumed by the^ program. 
The DNm CARD lists the sequence number of each variable. The 
sequence numbers appear in the order in which they will hk used 
for the analysis. The FORMAT card indicates the FORMAT of each 
of the five cards per case.* FORTRAN assumes the same FORMAT is 
to be used, for each 'Card. The FORMAT indicates five co'lumn 
fields with the decimal placed to the right oif all digits. 
Decimals punched on the card override the FORMAT. 

Alternatively, the cards in figure 9 accomplish the same 
purpose. In this case, the FORMAT CARDS account for only the 
twelve variables to be used in the analysis? hence the default 
value can be used for parameter 3 on the RUN CARD, setyj^ing 
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12 2 2 i'' ' ' ■ ^™ 

1 2 4 3 5 1 6 7 8 la XI 12 ' 9 (DATA CABD) 

(5X,F5.0.10X.F5.0,15X,F5/0.5X,P5.0/20X,F5.0,35X, • ^^°^Lnc\ 

F5.0/lOX,F5.0.1OX.2F5.0,I0X.F5.O.lOX»F.5.0//3OX,F5.O) CARDS) 

Figure 9. Control cards with FORMAT for variables 

in cjflirrent analysis ' 

^ parameter 3 equal .to parameter l^TV This illustrates the point 

that th^ number ofv variables accounted -^or by the FORMAT should^ 
be the value of parameter ^.^^ Note alfeo, that the variable se- 
quence numbers.-^so reflect (the secjuence nuiriber of each van- 
able whic^i is /established by the FORMAT. CARD. ; ' * ^ 

I,f the data were unformatted, the first alternative- is ap- 
'propriate/ exc^t that parameter 4 on the RU?3 CARD should be 
zero (of blank )\ and the FORMAT CARD should be omitted. 




System Information Cards 

The system cards required vary among installations. The 
program' described here was written and tested qn thte -AMDAHL 
system at the Instructional and Research Computing Center, Ohio 
* State University. The AMDAHL works lik6 an IBM' ihachine in most 
important respects, including the main features^bf JOB CONTROL 
LANGUAGE (JCL) used by IBM. The, description of system cards 
' given here is c-nf insd' to the required JCL; it is assumed that 
users know or > ve access tq general rules for JCL. Users of 
systems not accepting JCt must consult with personnel of thexr 
facilities to -learn about the required system cards. 

A schematic view of the type and placement of JO^COnAol 
ca*ds is shown below. 

JOB statement 
EXEC statement 

DATA DEFINITION statement for FORTRAN program 

(FORTRAN program deck » followed by a ' . 

card with /* in cols. 1 and 2) 

DATA DEFINITION statement for input data 

[input data, if on cards, followed 
by a card with /* in cols. 1 and 2] 

DATA DEFINITION statement for program cfONTSOL CARDS 
(Program CONTROL CARDS, followai by a card 
witil /* i». cols. 1 and 2] " 

^. • . . . ■ 
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If the data are'On cards the last twp DATA DEFINITION (DD)- c^irds 
can be in reverse order. The information not in brackets is JCL; 
the bracketed information is not JCL. The JCL cards are describ- 
ed below in the order listed. ^ 

The job statement takes the following form 

^ 

//jobname JOB operands . 

J' 

The double slash always appears in the first two columns of all 
job control cards; no spaces may, appear .except ^here shown. ,'The^ 
jobname is a one-to-eight character alphameric name generally 
selected by the user. It must begin with an alphabetic character 
or $. ■ Some installations may supply the job card, or the -first 
card of two or n^re .cards required to complete the job statement. 
(When^A JCL statement spans more than one card, the last char- 
acter *on all but ^he last card of the statement is» a comma. y 
.The operation "JOB" must be punched as is. The operands vary 
according to the job and installation. 

The execute (EXECf statement takes, the following form 

//stepname EXEC procname, operands 

The stepname is an alj)hameric name of up to eight characters 
supplied by the user. It may be omitted, but one or more blanks 
must separate the // from the EXEC if the- stepname is omitted. 
The EXEC must be punched as is. The procname- refers to the 
name of a catalogued procedure supplying frCL for the PORffRAN 
program. 'Procedures may vary among installations- Operands 
are optional and generally can be* omitted. An example with 
stepname STEPl, procname FORTRUN, and n9 operands follows: 

//STEPl EXEC FORTRUN 

The DATA PEFINITION statement for the FORTRAN d^ck takes 
the following -form: - 

//ddname DD * 

It tells the computer, that the FORTRAN deck follows. An example 
using FORTRUN follows . ^ 

//STEPl EXEC FORTRUN " . - ' 

//CMP. SYS IN DD * , , 

where the ddname is CMP.SYSIN. ^ 

The DD statement tor^ the input data takes the following 
form if the data are on cardfe: 
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where ddnaifrS is the/ data-definition statement name and depends 
on the catalogued procedure. For FORTRUN, the following state' 
ment is correct: 

//GO.FTlXFOOl DD 

The eleven fpllo\^ing FT designates the unit number and should 
match parameter 4 on the RUN CiHRP. It need not be eleven, so 
-long as it matches parameter 4 /and ip not the same as unit num- 
bers designated /for READ, WRI7E, ernd punch by the catalogued 
procedure. Uni^s 5, 6^. and 7^ respectively, are^ typically re- 

• served for these operations. - - 

If the Aafa. are. stored on tape or. disc, the * following the 
"DD" must be replaced with information describing the location, 
of the data, ^uch as the data set name (DSN) and DCB informa- 
tion. The variety pf such information ia too great to perrait 
description hbre. Jii order to create the tape or disc file", 
the user musu know the required information th§t can be used to 
aceess the data by t^is program, . , • 

The DD /statement for the control cards has the *same gen- 
er|il form a4 that for input data on cards. This general form 
;typically assumes the specific form of 

• ' / • ' 
//GO.SYSIN /DD * or * 

.//SYSIN DD* * . . 

. v ^ - • • 

A coniplete run using the FORTRUN procedure and \mit 11 for 
input dat4 on cards is illustrated below. 

^ 4 

//AlfiOO Jibs REGION=200K,TIME=a ^, » 

//STJSPi ^XEC FORTRU^I ' • 

//CMP. SYSIN "DD * . . ' . 

FORTRAN prbgram • ' 

/*''./■' ■ ' ' 

//GO.FTJlPOOl DD * ' . . . . 

.' d^ta cards • , • ' 

/* • ■ . : 

'//GO^XSIN DD * . ^' * K 

Q0NTROL CARDS • . ^ \ ' 

y* • . .* ' > 

The double 'slasf) in the first two columhs^of the last card des- , 
ignates the end-of the job. Note ^that t' jobnatee is AlOOO', *and 
tvO( opef ands^ are {Specified, indicating <. K bytses of storage 
and a one-minute time limit for the job. 

It should be noted that the. program assumes that xiiiii. 5' is 
reserved for rea^ing/control cards'-and unit 6 for printing. If 



the catalogued procedure makes- a different allocation of unit 
numbers, the following DD statement should follow immediately 
the program deck: ' ^ 

^ • * . 

y/step.FT06F001 DD?* SYSOUT=A, DCB«RECFM«FA ^ 

and* the following carcT'should precede the ^CONTROL CU^RDS: 

//step.FTOSFOOl DP * ' . ^ 

.» ■ 
where step refers to a step name (elg., GO) within the catalogued 

pxfOcedure^. ' 

'v •■ 
Program Output 

There! are four types of output generated by^the programs, 
each type beginn|.ng on a new page of the printout . These four 
types are: (a) a record Of the inforn^ation contained on the 
CONTROL CARDS 'and default valiies , (b) univariat'i and bivaAate ' 
statistics, (c) OLS multiple regression statistics, and (d) es- 
timates of parameters of the differential ,equai:ion. These four 
types of output are described briefly below. 

• o 

Record of control cards 

— ... 

- A sample of this output is shown below. The output is 
labeled with terminology closely matching that used in this 
section. Most of the output is self explanatory due to the 



RUN-CARO PARAMETERS 


NV = 


8 


NIV « 


2 


NR » 


8 


ITYP = 


0 


NU « 


12 . 


T » 


12.0 


MIS « 


• 10006 ^-lO 


VARIABLS 


SEQUENC5 




NUMBER 


1' 


I . 


2 


2 


' I 


3 






5 




6 


6 


7 






? • 







V 



labels. Parameters of the. RUN'CARD are clearly labeled. ' Wh^n 
default values are used by the program, these are printed. ' Note 



% 

that information on the DATA CARD{s) is given. linde^ .the column 
titled "SEQUENCE NUMBER." The cprrespondin*?' Prd©r of the vari- 
able used in the current run is printed in the column just, tb' • 
the left, labeled "VARIABLE ORDER. The FORMAT CARD(s) is (are) 
printed exactly as' it (they) 'appear in the input. In the example, 
there is no. FOR^T card. . , . _ . _ 

* t' ' . • ' . 

tjnivariate and bjvariate statistics 

Univariate statistics are calcul^te^l for each variable ijy 
"including all data present for that variable, snd^bivariate 
statistics are calculated by irt^lu^ing all ca^es for which data 
are present on both variables in the pair. Thus the sdttiple size 
and univariate statistics may di^fef for a given variable de- . 
•pending on the Ither variable with>hich <t is paired. Conse- 
quentlij, the format for. writi^ig the univariate and bivariate 
statistics allows for a different sample size, and univariate 
statistic? for each variable pair. The following informatror 
is .printed for each variable pair. A sample of. the output is 
reproduced on the following page. 



Column 
Heading 



N(I,J) 
XBR(I) 
XBR{J) 
SD(I) 

SDci) * 

COV{I,J) 



R(I,J) 



Content 

Variable order number of first variable in 

the paiir ' . ■ wt « V« 

Variable order number of second variable m 

the pair i i^- ^ 

Number of observations present for variable 

pair (I, J) • . , • 

Mean of variable I when paired with variable, 

J ... 

Mean of variably J when paired with variable 

Standard devia .ion of' variable I when paired 
> with- variable J ' ■ • a 

Standard deviation of variable J when paired 

with variable I , . . , 

Covariance between variable I and varianie 
J using all cases in which information is 
present for both variaisles, the number of 
such cases being N(I,J) ' £• v^i 

Correlation between variable I and vafiat)ie 
J using all cases in which information is . 
present for both variables, . R(I ,J)=COVU fJ; / 
(St)(I)*SD{J)) 



on 



o 
o 



UMIVARI^TE *N0 STATISTICSt N « 

I J' NIX.JI JCBRfl) XRRCJI 

6370*2 
-90.33*65 
-80.33*65 
5b. 25*5* 
55.25*5* 
55.?'^*5* 
-10.07* >7 
• -10. 07.97 
-10.07*«^7 
-10,07*97 
12«,5603 
129,5608 
129.5508 
l?o.5f05 
129,5608 
55.^33*^3 
55.e3?53 
55,83353 
55. S 335 3 
5S^3B53 
IT?. 83353 
-9.3B0(S58 
-9,3RD65S 
-9,390659 
-9.3'?n659 
-9.38065 8 
^9,39065*5 
-9.3fl0655 
162,3972 
162.3972 
162,3V72 
162.397 2 
162,3972 
162,3972 
162,3972 
162,3972 
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t 


inn 


?7042 




€ 


X V 


£ 




1 nfS 




1 

» 


-a 


\ m 
I \ f u 




C 




i U V' 












1 
1 




1 rtn " 

1 Uvf 


2 




1 on 




4 


inn" 


rX0.07<»97 


4 


4 


1 
i 




. f nn 


— 3 f [JH f 






1 nn 


o f\ !I^/A.C 




c 




^ ^ m '» *• T *t 






1 00 




5 


i.)n 


12-».56')8 


1 


6 




-A.*370*2 


2 


6 


100 


-fln, 33*65 


3 


6 


100 


55. 2*; -^5* 




<S 


100 


-10.07'.';7 

1 7v.5fcna 




6 


1 >o 


6 


6 


100 


•^••S. R5^S3 


i 


7 


iOO 


-*.*370*2 

-sn,3?*65 

5'^. 2'^*'.* 




7 


\oo 




7 


100 




7 


IPO 


-10.07^97 


• 5 


7 


100 


179.5608 
&5. 83353 
-o.3P065^ 


ft 


7 


loo 


7 


7 


100 


1 


3 


100 


-* .*370*2 


3 


6 


rno 


-^n,3?*ft5 




IOC 


5-5, ?;:i54 
-1 Ot07<.«>7 
1^9. 560 «i 






100 








6 


3' 


JOO 


5 5,i53 35 3 


7 


8 


100 


-9.380659 


8 


8 


100 


162.3972 



100 

SOfll > 

391 .'SOI 1 
391 .«<01 1 
2*3,*670 
391, "Oil 
2*3.*670 
8.1 1876 5 
39l.«<01 I 
2*3,*670 
8,in7t.5 
4*. 07031 
3V1.801 1 
2*?.*fc70 
8.1 !'?76 5 
A*,O7031 
*00.917I 
39UB01 1 
2*3.*670 
8.118765 
A*. 07031 
A0n.«;i7 1 
5.3B371* 
3Ol,f>01 1 
2*3.*670 
8.118765 
<»*. 07031 
400,9171 
5,3e37l* 
17.85116 
391 .801 1 
2*3.4670 
8.118765 
44.07031 
400.9171 
5.383714 
17.85116 
156.4833 



S0{ Jl 

391.8011 

2*3 .*670 

2*3,*670 

e.n'»765 

8.116765 

8,116765 

44.07031 

4*. 07031 

44.07031 

44.07031 

4fi0.9l71 

400,9171 

400.9171 

400.9171 

400,917 1 

5.3b3714 

5,383714 

5,393714 

5.3?3'»14 

5.393714 

5.383714 

!7,«»5U6 V 

17.S5116 

17.85116 

17,15116 

17.85116 

17,85116 

17.85116 

156.*'833 

156.4533 

156.4833 

156.4833 

156.4833 

156*4833 

156.4833 

156,4833 



CQVII ,J) 

, 153508.1 
-6219*1.64 
59276.20 
-IA2.074«} 
-21,7';?62 
65.<»1*34 
-9090,164 
1559,937 
129,?071 
1942 .192 
3301'^. 69 
1034-',66 
•?35. .^142 
-392'. 602 
1617 3*. 5 
-691.7J46 
350. ?!22 

21,7?3'0 
133.*654 

-?O5.«^6?0 
28.95 436 
81,71739 

-1720.329 
2,503508 
42*, 0796 

-3469.643 
14,36941 
318.6638 
14095.62 

-402T.212 

.26U060'5 

^638. 1230 
#9678.15 

-130.622? 

-607#0796 
24497.04 



I. 000000 
-.6520*12 

l.OOOOOO 
-.4*947340-01 
-.11035290-01 

l.OCOOOO 
-.5264537 

.1*53'i55 

.3627959 

1.000000 

.2159138 

.105«69? 

.2566288 
-.2164074 

l.ooonno 

-,42?74«f9 

.2740*96 

,4970003 

.5633465 
-.1429573 

1.000000 

.llf 3375D-01 
-.3955266 

.1 72739-fiO-Ol 

.5390571 
-.4848012 

. 149*170 

1.000000 

.2297*30 
-.1057052, 

•2054965 ■ 
-•92531640-01 

.7918495 
-.1550487 
-.2173257 

l.OOOOOO 
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OLS multiple regressiqin / > . 

The OLS regression yields the ma'tJ^ix P =IA*,- B*], in. stan- 
dardized and urtstandardized form., It also calculates multiple 
correlations. "The output iS labeled in a fairly clear manner. 
The first line of output includes the total sample size (N) the 
decrees of freedom for riumerator and denominator of F ratios 
associated. A^ith each regression and error code (lER) . If lER ^ 
0/calculaa.ons proceeded nomally.' If lER =^-1, .matriae xrtver- 
sion did nbt occur, possibly btecause the correlation. jnatrix, is « 
not positi/ve definite/ If IER> 0, e3«cessive round^^ng error 
.may have occurred. . ' . ' • 

The regression coefficients (standardized and unstandard- 
ized) are arranged so th'at dependent variables (DV) cross rows 
and independent variables cross col\imns, as labeled. The in- 
teger ndmbers 'to the left of each row and at ,the- top of each 
coiumn of these matrices'" give the variable sequence nuaiber (not 
the order number) . ft sample of the' output is shown on thp fol- 
lowing pagSi^ ^ ' • ' 

\ I '. ^ - ^ • 

DiSferenViai.' equations output 

There are three types, of ■ output related to the differential 
equation, eacb type beginning on a new page. First s the un- 
standar0i^ed matrix P is subdivided into its component parts 

A* and B*, 'and these componea^s, are printed without changing 
their valued.. A sample of the output is reproduced below. 



MATRIX Of A* COEFFICIENTS — 

0 .1 2 

6 40.759 -}.8&6i7DVfl3 .478270-02 

7 ■ n:220 *^ '.loHeO^l . -.228720-01 

8 185.67 -.272«»70-01 ^.-.16887 

MATRIX OF B* COEFFICIENTS — " ' 

^3 4 5 

6 ' .29262 ,360380-01 -.270810-02 

7 -tillia. .28055 r.UO2^pO-0l 

8 -l-,3928f .44887 .34361 



The matrices aV and B* are labeled in. th§ o"^?^?> Again, 
dependent variables .eross"rcrrfs and independent variables (x for 
A* and for B*) cm^s coluWs. The integer numbers beside 
I- each row°and^above eadh columk identify the variable sequem^e 
number. The "0" above columnjpne of A* designates the inter- ^ 



cepts 

l6l 



OUS MJLTIPtE RECESSION * 

N « 100 MUMERATOR OF * . 5 DENOMINATOR OP 

STanOaRIZcO RECiRESSION C06FFIC lENYS— ^ 



OV 

6 
7 

8 



XNDEPeNDENT VaRlA-BLES — 

I 2 3 

-O.Oft^O^i 0.?X5?9 O.A^!?8 

o.r^Ai^ -o.-in9<» -o,i<»607 

-rf^.06P3'i -0.26273 " -0.0722$ 



f^94 RETURN CODE FROM INVSt IBR • 



ii&STANOAROIZeo RECRtSSlON COEFFICIENTS-- 



DV 

6 
7 
8 



. INTERCEPT 

0 

4n,75«J 
1 1 ,220 

MULTIPLE R — 



0.70729 



inOependent variables — 



-.8ftA172-»03 
.10fe7Hn-ni 
-.272070-Oi 



V 7 
0.80607 



.^73270-02 
^^.228720-01 
-.16 837 



8 ' 
0.82621 



0,29500 
0.69262 

(JiI26«2 



•29262 
-.32118 
-I .3928 



\ 



-0.20167 
-0.31<»<J7 
0.88035 



.360390-01 

.28055 

.'5i4!?e7 



-.270810-02 
1A0240-O1 
.34361 
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Tne eigenvalues and eigenvectors of B* comprise-^he second 
pf output related t to the differential eqlaation. A sample 
shown bfeloW. 



^type pf output 
of the output is 



REAL PARt OF EIGENVALUES- OF 

1 ' 2 
.265<>5 .26545 



.38588 

IMAGINARV PART OF EIGENVALUES OF 3* — 



EIGENVECTORS OF.B*^ 
IF ADJACENT VECTORS 
COLUMN CONTAINS THE 
CONTIANS THE IMAGINARY 

RIGHT EIGENVECTORS 



-.13346 
FOLLO>J. 



THEY ARE STORED SUCH THAT 

TK;K*n ARE conjw;ate p*3S^j J^L,'^^" 
reaLpart, and the {<*x)th column 



1 

1 ,174340-01 

2 -.1?P'>40-01 

3 .22198 

LEFT EIGENVECTORS 
I 



1 
2 
3 



18,568 
-2.1149 
,64275 



.46?'?20-02 
,677870-01 
.893l<»D-0l 



-2.1615 
-6.9361 
-.22190 



-.321090-01 
-.177950-01 
.B6900 



-10.358 
-.29661 
.76196 



The first row of output displays the real part of the eigenvalues 
of B*, and, the second row displays ^e imagin^y component of 
the^corresponding eigenvalues, % 

Let A be a diagonal matrix with diagonal elements equal to 
the eigenvalues of B*, and let V be the matrix whose columns 
are eigenvectors associated with A, then . 

B*V = VA 

Assuming B* is diagonalizable , V is the matrix of right eigen- 
vectors ol B*, and (V-i) ' is the matrix whose columns are com- 
posed of thi left eigenvectors of B* . The output labeled RIGHT 
EIgInvECTOR^ gives v! and the output labeled LEFT EIGENVECTORS , 

gives (V • . 

The estimates of the parameter matrices of the differential 

equation (28) (I and B) comprise* the third type of output re- 
lat-ed to the dilferential equation. A sample of the output xs 
reproduced below. 
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INTERCEPTS £ COEFFICIENTS OF PREDETERMINED VAR1A8LES» A — 

> ' . 

Tl' .INTERCEPT INDEPENDENT VARIABLES — 

' 1 0 I .2 

* *<S8.49Z -.2*7310-02 .9*8020-02 • 

- 2 n-*"''' .163l?D-ni -.309120-01 

8 375.*^ T. 577820-01 -.23550 

COEFFICIENTS OF ENDOGENOUS VARIABLES, B — 

Tl • TO. VARIABLES ' 

3 * 9-5 

6 -Jiljp .12*26 -.556390-02 

• Z ''l*?5?ir -1.1618 -.*6798D-0l 

8 -3,*3lp 1.6079 , -1.0*61 

■ ■ . \ ■ . • ■ 

The format of the output follows precisely the format for 
printing A* and B * . The intercept and exogenous variables (x) 
'ap|5ear ^cros^ the columns of A. The t^ (TO) endogenous vari- 
ables appear across the columns of and t, values vary, 
across rows^r as labeled. 



Data Transformation 

A s\ibroutine named DATA is called after each line of dafta 
is rea^. This subroutine can be used to carry oub dat^ trans- 
formations. The general use of^ the subroutine is descril^d be- 
low, and an illustration is presented in which varying missing 
data codes are receded to a single value - 

If no data tr^s format iq;TS~are desired, the subroutine is 
defined as follows: 



SUBROUTINE DATA(X,R,M,IPK) 
REAL X(l) , R{1) 
INGEGER*2 JPK{1) 
DO 1 J ^ 1, M 

X(J) = R(IPK(J)) 
CONTINUE 
RETURN V 



END 



This version of DATA -is included in the standard version of the 
program. package , and nothing need be added if the user desires 
no data transformations. The variables are defined as follows: 

X =.a one-dimensional array which receives the values of 
the variables to be used in -the current analysis 
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R = a one-dimensional array which contains values of all 
variables in the data set, or all variables accounted 
for by the FORMAT statement 

M a scalar giving the number of var'iables to be used 
in the current analysis . . 

IPK = a one-dimensional array (length two integer) whose 

elements contain the sequence numbers Qf the variables 
to be used in the current analysis. These ar^ listed 
in tl^ order' in which they appear in the analysis on 
the RUN CARD 

The values of R, M, and >IPK are passed to DATA by the subprogram 
which accumulhtcs sums for. the correlation calculations j the 
variable X is i»teturned • to the calling -prog r\im. ^ 

If data transformations arV desired, one may replace the 
above version of DATA with a user-written version. The user- 
written routfine must contain the same variables in the calling 
list that are shown above (i/e., X, R, M, IPK) and must define 
each element of X. An exampWf a possible user-written ver- 
sion with six- variables in the current analysis is shown below. 
The example recodes a vg^riety of missing data codes to the 
standard value of 10^ (1.E9). ' ^ . • 

SUBROUT I NE^- DAT A ( X , R , M , I PK ) 

REAL X(l), R(l), XMIS(6)/9.,99.,9.,9.,9.,999./ 
INTEGER*2 IPK(l) ^ 
DO 1 J=1,M \ ■ • 

X(J) = R(IPK(J)) 
^1 IF(X(J) .EQ.XMIS(J) ) X(J)=1.E9 

RETURN . ^. • . > 

END , ^ , • 

Note that the array XMIS contains the missing data codete for 
• each X value. These -are defined by the declaration statement: 
REAL Xfl), R{1), XMIS(6)/9. ,99. ,9. ,9. ,9. ,999./. ^ 
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* '' CHAPTER 5 . ' 

ADDITIONAL INTERPRETATIONS OF CHANGE COEFFICIENTS 

' The material in the preceding chapter sujE^plies the back- 
ground necessary for a more thorough examinat»ion of substantive 
interpretations of differential equation systems than could be 
presented in chapter 2. There are four topics to be examined. 

r First, interpretations of the coefficients of ^the differential 

equation (32) are compared to interpretat5»ns *of the correspond- 
ing integral equation (33) . Secondly, the time path of the 
career expectation variables implied by the differential equa- 
/*tions is examined. This discussion includes consideration' of 
equilibtium and oscij-lating systems, "thirdly, a method of 
standardizing the coefficients of the differential equation sys- 
tem is presented and the advantages and disadvantages of inter- 
preting the , standardized coefficients are discussed. Finally, 

A. a brief preserttation discusses a generalized correlation for 
^ assessing accuracy of forecasts. 

Coefficients of the Differential Equation and 
Cross-Lagged Regression Coefficients 

. . The OLS estimates of A* and B* in the integral equation (33) 

are cross-lagged regression coefficients. It has been suggested 
„ that cross-lagged coefficients be used to assess the relative 

effects of one variable on another (e.g., Heise, 1970). In the 
present context, interpretation of effects by reference to the 
parameters A and B of the differential equation system (32> is 
•an obvious alternative. As noted in chapter 2, conclusions 
about causal relationships may depend on which coefficient 
rttatrices are used in the interpretations. 

>»■ 

To facilitate the discussion, some of the basic relation- 
ships are reproduced here, and the notation is altered slightly. 

(^0) £ = A^ + By^ + u (differential equation) 

(51) 'j^^ = A*(t)x + B*(t)j^ + u*(t) (integral equation) 

where 

X = (1 + L) X 1 vector of exogenous variables 
y = K it 1 vector of endogenous variables 
= y at time t 



u 



K X 1 vector of disturbances 
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u*(t) = K X i vectpr of d^isturbances y. r .( 

A « K X (1 + L) matrix of coefficients of the enogenou& 
variable-s • 

B = K X K matrix of coefficients of the endogenous 
~ variables ^ ^ ' * 

A*(t) = K X (2 + L) matrix of coefficients of the exogen- 
~ ous variables for a discrete interval of time 

equal tb . 

B*{t) --^ X" K -matrix of coefficients of the endogenous 
variables for a discrete interval of time equal 

*^ to t s , " ' , ■ 

The matrices A*(t), {t) , and the vector ^ (t) are denoted here 
explicitly as~f unctions of the time interval t between ' 
t = 0 and t^ = t; the fact that these elements depend on time 

is an important aspect of their interpretation.' • The following 
relationships hold between A, B and A* (t) , E[* (t) . 

. Bt 
e— 

r 

[In B*{t) 1/t „ 

t 

(.e^^ - I)b"'^A,^ if |B| ?^ 0 
B{e-^ - I)'^A*(t), if |Bi 0 ^ 

Bt * -^v^^^ + fV,V^^U"^A*{t) , 




{52a) 


B* 


(t) = 


(52b) 




B 


(53a) 


A* 


(t) - 


(53b) 




A = 


(54a] 


A* 


(t) = 


(54fo) 




A = 



t 



A = [ (e-^ - VliLi 1 ^ ^'Iz^ 



if IbI = 0 



In a sense, the coefficients in A and B are fundamental to 
the theory because they express the instantaneous impact of x and 
y on changes in i over time. Also, A and B are simple to inter- 
•pret because they are constant over time. In contrast,. as re- 
vealed by equations (52a)' and (53a) or (543), A* and B* are 
•matrix functions involving time. Consequently, simplistic-com- 
parisons - among entries of A* -and B* to index relative effects of 
different variables shouldlje avoided. To see why, ex&mine the 

relation B* {t) = e- ; this implies that 

(54) \*it) ='l.B*(l)3^ ;0 
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where B*(i) is the coefficient matrix of the integral equation 
over a single tinie period, say one year. Thus, for example, 
the matrix of cross-lagged regression ^coef f icients over' a two- 
year interval is just the square oi.»the matrix for a one-year 
interval. From this fact alone it 'can be seen that €114 elements 
of B*(t) for different t do not bearra s*imple relationship to 
eacH other . 

Consider a four-variable example with the cross-lagged co- 
efficients for a one year interval shown in the following matrix 



B*(l) = 



.6 


.4 


:5\ 


.13.\ 


.2 


.8 


.11 


.4 


.05 


' .1 


.9 ' 


.5 


.3 


. .01 


.4 


-7 1 



if the same system were studied over a two-year interval, the 
pross-lagged regression! would be 



.611 


.846 


.579 


.735 


.447 


.681 


.195 


1.046 


.84.7 


.175 


.^91 


.733 



B*(2) = IB*(1)]^ = /.504 

.^06 
.245 
\.412 

Clearly, interpretations drawn f rom^ B* (1)" would differ from 
those based on B*(2). First, ob^erving^'^d) , one would con- 
clude that the effects of the lagged value of each endogenous 
variable on its cur4-ent value aominate the system because in 
every casei the di&gonal elements of B*"(l) are substantially 
greater than the off-diagonal element-s. This observation does 
not hold for B*(2), however. In fact, observing B*(2), one 
might^^be impressed by the fact that "effects" othejf than- those 
of tnfe lagged endogenous variables on themselves are so .strong. 
Also, the relative magnitudes of the coefficients have shifted, 
the most dramatic- example being the change in relative magnitude 

(4, 1) elements: for the one-year interval, 
b*^(l) = .3, the difference being ;i'3 - .3 = 

-year interval, the direction of th^ difference 
= .579 1 and ^^-^W = .412,%^o that the dif- 

- .412 = +.167.. In the first -case, one 
t-he effect of variable 1 on variable 4 far 
effect; whereas, in the second case just the 
is 'suggeste.d^ Numerous other changes in the 
of the coefficients can be observed when ' 
B*(2), but most are substantively inconse- 



(1, 4) 
= .13, 



and 
and 

tw 



b*4(2) 



of the 
b,J(X) 

17 . ^ For the 
has changed, 

ference now is .579 
would conclude that 
exceeds the reverse 
opposite conclusion 
relative magnitudes 
B*(l) to 



comparing 



quential. One interpretation of the coefficients in B*.(t) is 
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that they index accumulated tot^al effects — indirect plus di- 
rect effects — over the time interval. , 

--^Observing equation (5?a) or {54a) it is easy to surmise that 
the conclusions regarding interpretation of B*(t) extend to in- 
ter pre tat ioa^^ A*(t), For the sake of brevity-, however, no 
example is presented for A* (t) . 

Tfae main conclusion to be drawn from this discussion Is n'ot 
necessarily that the coefficient matrices of the integral equa- 
tion hold no interpretative va.lue-. Rather, interpretations - 
should proceed' with caution in full knowledge of the manner, in 
which A* (t) or B* (t) depend on time. One useful interpretation 
of these matrices is as -change coeffieients of a difference e'qua-. 
tion over a finite time interval, t. ♦Tp motivatq " this inter- 
pretation, subtract y© from both sides*^f equation (51) : 



(55^ y - y^ =^ A*(t)x -f tB*rt) -Ti^o u*{t) - ' v ~ 

The left side of this result is a change vector in ^ t^Y ^t " ^o^ 
ov<2r a finite time interval, ,t. It can be seen from the rigjht • ^ 
side of the equation/ that the cofef f icients of x and of f-diagonalj / 
coeff iq4ents of ^ are precisely' those of the Integral equation , 

(51) . The diagonal coefficients in the change equatipn can be 
calculated simply from the diagonal coefficients in the integral 
equation. If the time interval t is taken to. the- limit of zero ' » 
and division by dt is effected, equation (55) reduces tO| the 
differential equation (as required by. all solutions 'to •differen- 
tial equations). Thus, one view^ of the differential equatiory i? 
that it is a special case pf tl^ difference equation. The ques- 
tion' therefore may be raised i^why considet the coefficients of 
the differential equation more fundamental than those of the . . 
integral equation? The answer >rests with the initial hypothesis 
that effects are instantaneouB. . If • one doubts the hypothesis of 
instantaneous effects, then the parameters of the differential 
equation might not be viewed as fundamental. Even if the hypoth- 
esis of instantaneous effects is not tenable, 'however, it does 
not necessarily follow that A*tt) and B*<t) for some unknown t 
give the fundamental paraijaeters .©f the system. A viable alter- 
native may be to alter the functional form of the differential 
equation system, thus retai'hing the conceptual advantages of 
continuous rather than discrete time. r 
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Time Path of Career Expectations ^^^^ ^ 

Omitting the disturbance term from the integral equation 
(51) leaves an equation that can be used to forecast the values 
of all career expectation variables and^ all individuals atiaqy 
point in time: * - . 
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(56) « A*{t)x + B^it\y^ 

where is the vector of forecasts. ' The. time paths of the 
elements of ^.^ expresses the 46velopine;it over time of career ex- 
pectations predicted by the theory; the patterns associated with 
u*(t) are random and not accounted for by the theor'y.. The pur- 
pose of this section is to examine the behavior of the predicted 
vector over -time. 

The behavior of the 'time path of predictions can be studied 
readily if one substitutes the expressions for A*(t) and B^{t) 
in terrts of A and B into (56) : " , . 

(56b) « rB'"'"Ax + e^V^Ax + e^^y;^ |B|5^ 0 
These results hold if B is full, rank; if B is not full rank: 

(56c) = Ke^^ - I)VjlA^^V^^' + t-V2V*2MAx + e^V , 

|B|= 0 ^ 

(56d) £^ ^-(V^Ai^V^'^^Ax + t-VjV^^^Ax) 

+ e^^(V^A^^^^^)Ax + e^^y^, |b| - 0 

where, as before, A. is the submatrix of A containing nonzero 

il) * -1 

roots, Vj^f V' are the submatrices of V, V , respectively, 

* 12) * -1 

associated with A^, and V2 ' sre the submatrices of V, V , 

respectively, associated wiih the zero roots of B. 

It is clear from equations (56a) through (56d) that the 

time path of y^ depends in an important way on the behavior of 

Bt Bt 
e~ over time. If e- grows without bound, so does y, , ir- , 

respective of whether B is full rank. If e~ stabilizes or goes 

— A 

to zero over time, the predicted vector of career expectations y^ 

• ■ . ■ /■■"■' 

110 • 



stabilizes, if B is full rank (jBl ^ Q) * In gensrai,^^ grows 
llneairly wiLlWiit bound if<B is singular (|b| « 0). This is clears . 
by observing the term in (f6d)^which is linear in tv (i.e., ^ 
f^V V^^^)AX) . If all eigenvalues and eigenvectors of B are 
real^numbers, then the following matrix representatiion i^useful:.( 



«Bt ^ „^At„-l 
e~ Ve— V 



■ s- ft , 

where A is 'the diagonal matrix with' diagonal entries J^^f^^^^ 
eigenvalues of B, and V is the^. cor responding matrix cTf ^ig^- 
vectors. This fessalt does not -depend on A being real', o«t , 
- does not yield so^much insight when i.is complex. When 

eigenvalues (A) are r^l numbers, it'^is clear that^e»^ stabilizes 
:.__oiily if A ± 0. : . 

* ^or general^ matric?es (real or complex eigenvalues) , 'an al- 

ternative development »s useful. \ For this d€velo,^ment, the ^ . 
elements of e^^- aVg^ denoted by bi j . A, tedio.us' algebraic -ar^ju- 
ment " leads to the fallowing result. r-^- • , 

• (57) bj. - ^lYijj^e^ sin(Xj,^t + 6.j-^)^ . t . 

^ 

where ' ' . ^ a ^ ^ 

Xj^ = the kth eigenvalue of B and, in general, is a complex 
' * number; Xj^/= X^^^ '+ X^^i, with = -1 . 



V . , V,., = The i,kth seal and imaginary parts ,. respectively ; 
Rik Ilk of the i,kth c^^l in V (;>right eigenvectors of m 

kj y kj ^^he k,>th real and imaginary parts, respectively, 
R ^ . 'of the k,jth cell in V"! (left eigenvectors of B) 

^ * ■ • 

0 ot'herwise 
kj 



' kj 



Ill 



• 1 
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The' formula given fojr-6.., does ?.)Ot define 6'.., if v-..^v 4- 
^ i^k ijk lik R 

^Rik^l^^ =0, but fi^jj^ goes .to tlT/2 as this expression approaches 

zero. Hence, will be defined aa tTl, it v^^^y^^ 

^Rik^I ^^ 5' ^k' ^® real, then 

A,, , and vj^ are all zero, and one finds 
Ik I 



. ^ ^ijk - "^Rik^R 



}for Xj^ a real nuimber 



sq that, for \^ real one has Y^^^j^® sin (X^j^t + ^ij^j^ 
« Xrij^^r^^®^^^^' " I'f ail roots of B are real, (57f specialises . ' 

to ' . 

• X t ki • ' ' 

,.' (S3a> b*. = &v.,e k V ^, if all X are real 

i3 k - ' . 

Observing (57a) , it is clear that if the maximum X^^^ (A real) is 

positive, every element of e^*^ « 1*^*^) grows exponentially with- 
out bound. If the maximum X. is negative, each element pf 

e!^^' goes to zero' by arfi exponential decay. If maximum Xj^ ife zero, 

the elements of e- stabilize over time. Referring to (57) , 
these' conclusions also apply if one or more is qCTnplex? here 

the critical variable is the real part of X. . Also, due to the 

. f- ^ Bt ^• 

sinusoidal functipn sin (X ^j^t + 5^^j^) , the entries of e- os- 
cillate over time . ^ — ■ •' 

""combining these ot>servations about ^e behavior of Jbhe 

elements of e-*^ with equations (56) permit some conclusions re- 
garding the tinle path of the career expectation variables: 

' I. If the real part of the largost. eigenvalue of B (X^^ roaJc) 

is negative, then from (561^ one sees that the equilib-^ ' 
rium of is ' ^ ' 

^ y.(equil.) = -(i^A)x ' " , 

li;2 * ■ ' ' 



For < 0, therefore individuals gradually revert 

R max 

to a set of career expectations determined by their / 
status origins and mental ability, tf is near 

zero, however, this may occur so slowly that, by adult- 
hood, career. expectati9ns may still be far from equil- 
ibrium. .. 

If the real part of the largest eigenvalue of B is 
positive, career expectations increase without ' bound . 
Again, if \^ is close to zero, the growth may be 

slow.Xn the short run. 

If the largest eigenvalue of B is precisely zero^'B is 
singular and one must reference equation (56c) or 
(^6d) . From either of these equations it* is clear that 
for max" ^' ^^^^^^ expectations, in general, grow 
linearly without bound, • 

If any of the roots (X^^) of B are complex, an oscil- 
lation is introduced into the time path of career ex- 
pectations. For example, the level of job status ex- 
pectations waxes and wanes over time. The amplitude 
of the oscillations depends on the size of the real 
part of the eigenvalue; the larger the real part of 
the complex' eigenvalues, the larger the amplitude. 

Irrespective of whether B is full rank, if Ax = 0, then 
the time path of is Yo • '^^^^ means that the time 

path of career expectations does not depend on the 
status origins and mental ability of the individual . 
This is not a general conclusion, however. For some 
matrices A there may be no vector x except x = Ojthat 
makes Ax 0. In fact. Ax = 0 x = 0 generally if 
there arp more endogenous (career expectation) vari^le 
than exogenous variables (e.g. , parental status vari- 
ables) . ' srince the fi-rst element of x is fixed at 1.0, 
X cannot be zero. If a nontrivial (nonzero) solution 
to Ax = 0 exists, it means that there -is at least one 
peculiar pattern of the status background and ability 
variables that frees the time path of career expecta- 
tions from status backgro md and ability. A particular 
ly interesting case arises when the maximvun is zero 

but its imaginary part is nonzero. In this instance, 
if Ax = O, then career expectations tend to a periodic 
functionT i.e., the career ^expectations wax and wane 
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indefinitely with a constant amplitude between the high 
and low extremes. It should be emphasized, however, 
that this can happen only for individuals whose ability 
and status background follow the pecuXiar pattern satis- 
fied by Ax = O, and the periodic pattern is impossii'ble 
unless A is such that a nbntrivial solution of Ax (= 0 
exists . 



Standardization Procedures 



Since few of the variables used in career-decision making 
research are measured on a "natural" scale, i.e., one that is 
universally familiar, path analysis applications to career de- 
cisions fr€»quently are presented in standardized form, to fa- 
cilitate 'Comparisons among coefficients associated with vari- 
ables measured on different scales. Even when variables mea- 
sured on "natural" scales are the subject of study, standardised 
coefficients can fascilitate interpretations, either as a sup- 
plement for the unstandardized coefficients or as the primary 
fbcus of attention (Wright, 1960) . Consequently, it may be 
useful to present a standardization procedure for the differen- 
tial equation system. 



The standardization discussed in this section is confined 
to linear functions of single x and y- variables. Note that the 
usual standardization to zero mean and unit variance is such a 
linear transformation, but we wish to study more general stan- 
dardizations here. 

^ If one makes a linear transformation that is constant over 
time on eachf x and each y variable .to effect a standardization", 
the structure of the differential equation remains intact. To 
see this, define the following linear transformations on x and ^. 



^1 



C ) 

-y 



^1 



D (x 

-X - 



c ) 

~X 



where D is an arbitrary diagonal matrix of order K, D is an 
arbitrary diagonal matrix of .order L + l,'c_ and c , are re- 

^ y X 

spectively, K x 1 and (L + 1) x 1 vectors; D' ^ D , c , and c 

^ , ~y -X ' -y 

are constapt over time . Tn torms? of the original variables, 
the differential equation is written 

(62) d^/dt Ax 4- By + u 

Now, since and are constant over time, 
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= Dydy/dt 



Thus, , 



-1. 



D Ac + D, BD D (y + c *) 
— -y — ^y -y^^ ^—y' 



where u, = D u. Ndw, assume, that Jthe (1,1) element of is 

, . . y - J -.. - — n 

>riity and. the' first element o:5 c is zero, so that the first ^ " 

element t>f is also unity. Now, let the first column of 

^v/^^v fa® added, to r(D,Bc; \+ D Ac ) and denote the resulting- 
— y X — y — y — y — ^ * 

sum by a.. Denote the remaining L column^.of D AD by a. , and 
form the supermatrix A^ = [a^, a^] . THe differential equation 
system for the standardized variables can' now be written 

(63) d^^/dt « A^x^ + B^^^ . 

,; « 

with B, = D BD , This obviously has t;he same form as t62)-; 

y y .• 
hence, it is concluded that linear standardizations of single 
variables in the system preserve the structure of the system. . 
It should be emphasized, however, that this a^^^ument does not 
justify standardization of all variables in the regression 
analysis ,to zero' mean and unit variance. " The t^ and t^ en,do- " 

genous variables are both used in the regression 'analysis.' If 
both are standardized, the assumption that D and c are fixed 

-y . -y 

over time is violat;ed, since the mean and standard deviations 
of the Y. variables generally will shift over time, the and 

Cy applied to y values at t^ would differ from tliose for t^ . 



. 4 * 



A judicious choice of the standardization constants D„, 

~y 

I, and a is required in order to simplify interpretation 



I. 

of the standardized coefficients. 



For D and c , there is 

—X -X 



little difficulty. Since the mean and variance of the x vari- 
ables are fixed in the analysis > D and c can be defined by 



r 
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the means and ^andard deviations of x. Let the (1,1) element 
of D equal 1.0 ai^d the remQ,i*hing diagonal elements be defined 

—X . • r 

by i^he reciprocal of the standard deviations^ of x. Al^, let 
th< first element" of be zero and the remaining entries be 

defined by the negative of thje means of the corresponding xs. 
Then x^ is th<^ vector with element one equal to 1.0 and the re- 
maining elements equal, to standardized x variables with zero 
means and unit 'variances . 

For the endpgenous variables, choice of the standardization 
constants is somewhat more ambiguous than for the ekogenous var- 
iables. For the exogenous variables, the strong precedent of 
standardizing to -zero mean ancl unit variance was followed, but 
as noted above , this opj^^on is not available for the endogenous 
variables because it violates the presumption that and'c^ 

sare constant' over time. Nevertheless, a procedure- analogous to 
standardization, to zero means and unit variiances might be used. 
One migRt pick the means and standard deviations pf the y var- 
iables at a specified poin^ in time, say T, and use means and 
standard deviations calculated from the y variables at the se- 
lected time point. Thp"^ transformation constants would be 

~ ' ^^*^'?.y ~ "^^Lp^ » where is a diagonal matrix with 

diagonal elements equal to standard deviations of y at time T, 
and ETy^) is the mean (expected value) over observations of ^ 

at time T. 



The selection of T is arbitrary. If the system has a stable 
equilibrium, T may be set to infinity and the equilibrium means 
and standard deviations used ^o define the standar,dization con- 
stants. Interpretations 'could then be made in terms of movement 
toward the mean at equilibrium. If the system has no stable 

equilibrium, then one might set T = t and interpret changes 

, o 

• ^way from the mean at the starting point. Other options are, 
. of course, defensible. One may set T to the senior year in high 
school and interpret changes toward the mean during the senior 
year. 

» 

As with path, analysis, one should treat the standardization 
chiefly as a heuristic device — cjertainly, the relative magni- 
tudes of the standardized change (Efficients (A, B, ) can be 

shiHed at will by choice of the standardization constants. There 
^ is no clear reason other than precedent for using standard d^via- 
f tions rather than some other measure Ipf dispersion such as the 
average deviation, . empirical range, or permissible (theoretical) 
range of -each variable. Similarly, choice of the mean rather 
than some other measure of central tendency is based primarily 
on precedent (see Hotchkiss [1976] for an extended discussion of 
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this issue). The choice of c^, affects only^ the intercept, 
but the selection of D^, D affects the relative magnitudes of 
the change coefficients associated with the va^ables.. 

Forecasting ^nd Measures of Association 

The integral equation (51) can be used to estimate the par- 
ameters of the differentl^al equation, as described in the pre- 
ceding chapter, or, once the coefficients have been estimated, 
the integral equation can fcfe used to forecast or predict a 
value for each endogenous variable and_ each case, given initial 
observations on the vector of endogenous variables {y^) . -Equa- 
tion (56) is written explicitly as a forecasting equation. When 
OLS 1-r appiied- ro-'equatlon • (St) - a-imitiriTrl^rfDrr^-a^^ i|s 
sqif^re (R-square) for each equation isf" a standard measure of 
association arid is a normal part of the output of most ^multiple- 
regression computer programs. When the integral equation is 
used for forecasting, however, there is no standard measure ot 
association. The purpose of this section i^ to discuss a gen- 
eralized correlation that can be compared readily to correla- 
tions calculated from least-squares regression. 

As preparation for defining a goodness-of-f it measure to 
assess the accuracy of f'orecasts, it is useful to review the 
interpretation of R-square as ^ ,proportional-reduction-o£- 
error (PRE) measure. Let , " 

K 

y = Po + ^P.zv , j. 

. j = l-' 

where y is the dependent variable, the Zj are K independent 

variables, v is the error, .and the ps are constants. The fol- 
lowinq formula for R-square offers considerable heuristic appeal 

2 2 2 

•(^4) R ^ - 1 - s^/Sy 

where R^ .denotes the square of the multiple correlation ''{R-squar^) 
an^*6^ s^ indicate the variance of v and of y, respectively.. 
Th6 OLS estimates of the ps insure, that the mean of v « y ~ y is 
zero, where y is the value of y predicted from the regression 
equation. Hence, s^ is a variance of the errors of prediction 
from the linear regression: - ' ^ 
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2 * 2 

- E(y - y) , where E denotes expected value. 

The denominator of the ratio in equa^tion (64) is the var- 
iance of the dependent variable: 

= E(y ^ Ey)'' 

This variance can be interpreted as a measure of error in the 
absence of information about the independent vdriables, where 
the mean of y (Ey) is used as a constant predicting" the entire 
set of y values. Recall that, in fact, the mean of y is the 
best constant I. predi<^tor all ys, in Jihe sense that the mean- 
square-error <is miniifiized when th|E con'stant is the mean of tlie 
disbribution. * ° 



With this background , it is clear that the ratio %/®y ^s- 

a ratio of m^an-square errors — the numerator summarizing pre- 
diction errors froift the regression equation, and the denominator 
summarizing prediction errors when all values of y are . predicted 

2 2 ■ 
to be the mean of y. Hence can be viewed as ^kPRE measure 

1 2 2 

and,' ipso factor,^ so can R = 1 - When OLS estimates of 

2 " y 

the ps are used, the minimum R^, is zero and its jnaximum is one. 

A straightforward generalization of (64) provides a usefuO. 
statistic for summarizing the accuracy of forecasts. 



10. Let p^ be a constant over the y values and form the mean-^ 

square-error — MSE = E (y - p^) ^ . Differentiating with 

respect to p^, setting to zero and solving yields, 

p^ ~ Ey. The second derivative of M^E with respect to 

is the positive constant, 2^, Hence, a necessary 

" and sufficient condition for •a minimum is present when 
p^ - Ey. Alternatively, one might postulate P^ = Ey + 

any constant, and de'velop the following argument: ' . , 

; / ^ E (y - Ey + q) = E (y ~ Ey) + q 

Clearly, this expression is 'minimum if fnd only if 
q =f, p, so that p^ ~ Ey. 
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Define . . ■> 

4 

(65) 4c = - ^^^^l , 
where. R^^ denotes an R-square for forecasts", MSE stands for 
mean-square error, and s is the variance of the predicted endo- 
genous variable at time t; 

2 - 

MSE = Ety^ - y^) 



(Where y^^s the forecasted value of the scalar y^. ^It is 

important to distinguish y, the forecasted y, from the post 
facto estimated value of y from a regression analysis. Note 

that, in general E{y^ - y^) 5^ 0? therefore, MSE will not be a 

variance as it is in multiple regression.^ 

The maximum value of is 1.0 and occurs if and only if 

forecasts of y are correct for every observation. On the other 
'hand, the minimum Jl^^ is not zero; R^^ may be negative (indicat- 
ing R? should be interpreted as an imaginary number). When" 

used in cpn junction with non OLS regression estimates, R may 
also be negative, and it Has been objected to on these grounds 
(Fox, 1979: 145. 3a«eman, 1962). The . fact that 
R^^ may be negative, however, is not a strong objtection, since 
a negative R^^ has a straightforward*' interpretation? the nega- 
tive value indicates that more accurate estimates of all ys 
result when each y is estimated to be the mean of y ^than when 
the ys are forecast from the model. 

Numerous alternatives to (65) are available (see Fox, 1979; 
Ostrom, 1978). Of those proposed, one seems particularly ap- 
pealing. Ostrom (1968: 67) proposes that in (65) be replaced 
by the mean-square of the differences between t^ and t^ values 
of y. The implicit hypothesis of this mean-square-error is that 
thd y values, don • t -change ? a stable equilibrium has been reached. 
The resulting measure retains its PRE characteristic, since xt 
is based on a ratio of a MSE due 'to the model to a MSE derived 
from a "naive" model. A more general definition of 
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IS therefore suggested Csee Ostrom, 1978 i 68); 

^66) Rp^ - 1 - j^gg^ 

Where MSE(M) is the mean-square error for the theoretical model 

and MSE(N) is the mean-square error for a "naive" model (naive 

is Ostrom's term) . One important advantage of defining 

2 ' * 2 

MSE(N) - s^, as in (€5), is that R^,^ is more readily compared to 

R-square than with other definitions of MSE(N); nevertheless, 
a variety of MS£(N} might prove useful, depending on the cir- 
cumstances. 



The mean -square errors in (66) might be replaced by aver- 
age deviations. One advantage of so doing is that the error 
summaries are based on absolute values of errors, thus preserv- 
ing the -metric of th^ dependent variable rather than transform- 
ing that metric by squaring all errors. 

Reliance on the l?ivariate correlation between y and y 
(Fox, 1979) to assess forecast accuracy should be used with, 
caution. With OLS regression, this bivariate correlation is 
.the same as the multiple correlation, but this equivalence does 
not generalize to Rp^. The difficulty with the correlation be- 

A 

tween y and y is that it presumes that two regression constants, 
in addition to the parameters of the model, are utilized to make 
the predictions. Thus, systematic error in the forecasts could 

easily be masked by the correlation between y and y. It is 

i ^ 2 

tj^oretically possible that the Rp^ is negative even when the 

bivariate correlation between y and y is high. If this fact is 
recognized, however, the correlation might be used in conjunction 
with Rp^ to assess the degree to which forecasting errors are 

systematic. , 

The distinction bewteen R-square (from OLS) and Rp^-square 

'calculated to assess the accuracy of .forecasts is of fundamental 
importance in the assessment of theory. ""St^square assesses ac- 
curacy derived from a model for which the parameters are deter- 
mined post facto according to the explicit criterion of max- 
imijfeLng R-square. R-square cannot be calculated until after the 
dependent variable is observed and incorporated into calculation 
of the regression coefficients. In contrast, Rp^-square cal- 
culated from forecasts assesses prediction in the strict meaning 
of the term, because the forecasts are made prior to observing 
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the endogenous variables at the time point for which predictions 
are targeted. The values of the dependent variable at the tar- 
get date of the prediction do not enter into calculation , of thb 
prediction equation. One may conclude, therefore, that fore- 
casts comprise a much stronger test of theory than regression 
studies. This interpretation^ is reflected by the range of R- 
quare due to OLS compared to the range of Pp^-square from fore- 
casts. ■ , r- 

It should also be noted that most of these comments a^so 
apply to cross-validation studies. In fact, the roost convincing 
evidence in support of theory may be derived from 'cross- - 
validation studies in which parameteri|. of a differential equa- 
tion model (or analogous dynamic model) are estiroat^pd from one 
data, set '.anii-farecasts-^re_-assessed^nr a--di^f^erent data set — 
one collected independently from the data used to estimate the • 
parameters: Certainly, little is gained by splitting at random 
a single sample, since discrepancies between the ^ two Jialves (or 
several parts) must be due to^ sampling error alone and, there- 
fore do not test the robustness of the model under variations 
due to the many detailed ways in which data-collection pro- 
cedures may vary among data sets. In most cases, knowledge of 
sampling errpr is clearly stated in theoretical statistics; 
h'fence, little can be gained by a few observations of sampling 
error in a particular sample. 
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\CHAPTER 6 
SUM^RY COMMENTARY 



This volume is about the application of differential 
, equations to the development of career expectations. The start- 
ing point for the substantive ^'work is' path modeling of status 
. attainment as contained in the -efopio logy literature. The <. 
differential-equation model of career expectations may be viewed 
as a revision of path s^els of the same process. While status- ^ 
attainment r^earch in sojbiology provides, the^ theoretical frame- 
work, impb;ctant theoretical insights from' other research tara- 
ditions are referenced; ik part to justify the differential _ 
, equation model. More eplectic merging of ideas from a variety 
^f spurges^is-aeeded tnan is pres^xtted in this wo rk^ however. 

The mater i'al in the" preceding ."^apterV divides - conveniently 
into two main subtopics; (a), interpfi^tatibris o% the application 
\Qf differential equations to the theory of career expectations, 
and' (b) technical explication of. a particular ,^if ferential-equatioi| 
model of career expectations.^^ ^ , , . • 

Interpretation and Theory 

^ Chapter 2' presents ^justification for conceptualizing the 
process of developing career expectations as a simultaneous Hn- 
ear differential equation system i An example is presented whrch 
contains a set of two "exogenous" variabl.es, p^reiyfal status,! 
and measured m^tal ability, ^jid- a ^et of five "eridogenous" 
variables: school^grades , significant-other educational ^x- ./ 
pectation of ego, significant other occupational expectation Af 
ego, ego's educational expectation for self, and ego's occu^^ 
tional expectation for self. The rates of chaise over time in . 
the endogenous variables are 'hypothesized to be linear? functions 
of the exogenous variables and of -the current level of all the 
endogenous variables. The linear form of the model is viewed 
as an initial hypo-thesis, possibly to be modified later on the 
basis of theory and/or empiridal evidence. 

Three advantages accompany application of ttyT'dif f erential- 
equa*:ion s/stems to describe developing career expectations. 

, First, the continuous dynamic character of the development of 
C£u:eer expectations is Expressed by the differential equations; 
this feature of the dif ferentieC. equations is not shared ^y al-- 

. ternative models of career expecttations. Secondly, alsl {K)ssible 
two-directional effects among thdf endogenous variables are in- 
cluded in the differential-equation model. While two-directional 
effects are not a unique feature of differential equations, cur- 
rent path models of career expectations geneally omit two- 
directional effeicts.. Thirdly, sinc^ foreca'st^ to any point along 
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a tc^tinuous time scale are a natural Aspect of the dif ferenti^l- 
equ^ition model, stronof tests of the model are encouraged. Ob- 
servation of the ^xogenotis variables on one occasion and each 
endogen'ous variable on t^wo different occasions for each case . 
provides enough data to estimate" the. parameters of the model. ' 
The estimated parameters can then be used to forecast a value 
for each endogenous variable and each person at any point in time. 
The accuracy of these forecasts offers a difficult test §ot the 
theory to pass, a test not available by observation of multiple 
cortelations in a regression study. >. v 

The importance of using forecasts as tests &i theory ex- 
tends beyond the study pf career expectations. Moist social- 
science research is nonexperimental , that is, social scientists 
study natural systems of variables with little or no ability to 
manipulate important variables. Further, measurements of key 
'concepts in social-science research often stifain credibility. 
Thus, the barriers to scientific st\idy of social phenomena are 
high. Consequently, credibility of findings' from social re- 
search (demands the strongest empirical tests that can be mustered. 

while the linear differential-equation model of career ex- 
pectations offers advantages over current models, there are im- 
portant limitations of the dif f erential'-equation model which 
should be explicit. First, the differential equation model 
presumes continuous functions of time; hence, it is not readily 
generalizable to status attainments (e.g., occupation, educa- 
tion, income), because attainment variables exhibit abrupt 
shifts at isolated time points rather than continuous change. 
It is possible that career expectations manifest abrupt changes 
as well, although an apparently sudden change in expectation 
may be viewed as a continuous curve with short-radius turns. ^ 
It is po'ssible that abrupt changes can be modeled by a relatively 
new mathematical method called "catastrophe theory" (Zeeman, 
1977) . 

A second limitation of the dif ferential -equation model pre- 
sented here is that it assumes a linear form with the linear co- 
efficients constant over time. This assumptfeion can be relaxed 
should experience warrant, but the technical features of esti-^ 
mating a general nonlinear model or a linear model with non- 
constant coefficients have not bgen presented in the literature. 
A third limitation of the model iss-that it probably is' far too 
simplistic to capture even a reasonable approximation of the 
complexities of forming career, expectations. For example, the 
role of .uncertainty in forming career expectatidns is ignored 
altogether in the mode;. Adding realistic t:omplexities to the 
model- while preserving some semblance of parsimony should be a 
primary aspect of the research agenda in the coming decade. 
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Chapter 5 addresses four aspects of interpreting linear 
differential equation systems in conjunction with the process 
of forming career expectations. First, it is noted that .the 
"change coefficients" associated with the differential equations 
provide useful basis for interpreting effects of variables on> 
each other. The change coefficients express the basic theory. 
Further, they exhibit the virtue of being constant with respect 
to the length of the time interval between measurements. In. 
contrast, the cross-lagged path-coefficients (or regression co- 
efficients) depend in -a complex way on the. length of the time 
interval between measurements. Secondly, the structure of the 
matrix (B)t of change coefficients associated with the endogenous 
variables determines the time-path of developing career expec- 
tations. If the largest re^l part of the characteristic roots 
(Xs) of B is positive, career expectations increase exponential- 
ly. If the largest real part of the Xs is zero or negative, and 
B is full rank, career expectations seek an equilibrium that de- 
pendp on the background (exogenous) variables. If one or more' 
of the Xs is a complex number, career expectations oscillate 
over time; the amplitude of the oscillations expand, remain con- 
stant, or dampen over time, depending on whether the largest 
real part of the Xs is positive, zero, or negative, respectively. 
If all Xs are real, no oscillation occurs. 

Chapter 5 also develops a standardization methodology ''for 
differential equation' models that parallels standardizati-on in 
path analysis. It should be noted that standardized path co- 
efficients should not be used in the analysis because they - \ 
artifically remove changes over time in means and variances from 
the data. The standardization methods developed in Chapter 5 
retain changes in means and variances and fascilitate comparisons 
among coefficients .associated with variables measured on dif- 
ferent scales. One should be cautioned, however not to over- 
interpret these coefficients. There chief value is heuristic. 

The final topic in chapter 5 is assessment of forecast 
(prediction) accuracy. Several _proporjtional-reduction-in error 
measures of strength of relationship are discussed. One par-, 
ticular measure involving the ratio of the mean-square error 
due to forecasts to the cross-sectional variance represents a 
generalization of the square of the multiple correlation (R- ^ 
square) ? hence, it is read?.ly comparable to R-squares. It 
is noted in the chapter that the generalize(^ coefficient has no 
lower bound; it can a'ssume negative valufes. This fact is not 
grounds for rejecting the ipea sure, however, since negative 
values indicate useful infoi?ntation, • viz, th^t the Wross-sectional • 
mean of the dependent variable is a more accurate estimate of all 
the vdlues of the dependent variable than- are the forecasts. 
It i^ concluded that the negative range of the generalized cor- 
relation reflects the fact -that empirical tests based on fore- 
casts offer more stringent tests of theory than do regression 
studies. , . 
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TecjinicaX Information 

'"^In addition to the 4?nterpretative contents of this volume, . 
it .also contains the technical information needed to apply lin- 
ear dififetential equation systems with constant coeffiiients to 
tlie study of developing (cateer expectations. Chapter 3* sum- 
marizes important concepts .and theorems from mathematics that ^ 
^re'used in the rest of the monograph. ' Coverage includes basic 
ideas in calculus, concepts and operations related to complfex ^. 
numbers, and aspects of linear algebra. The treatment iis ex- 
tremely brief and intuitive, the purpose being to summarize Key 
concepts and theorems in a manner that is comprehensible to 
readers with little net hematical training. Chapter 4 pr,esents 
development of the theory of linear simultaneous equation systems 
with constant coefficients, proposes a justification fbr viewing 
the statistical estimation 'of the model as a "radttcea-fprm , 
system, for which -ordinary le^st squares'" (OLS) axe appropriate, 
and describes use of a computer program . which is available to 
carry out the calculations^ In the program the exogenous var- 
iables and the time-Jsero endogenous variables are the predete?- 
mijied variables and comprise the set of regressors. The tiufe- 
one endogenous variables. comprise the^ .set of dependent vari- 
ables. One OLS regression is calculated for each dependent 
variable using the same-* set of regressors for each' regression 
equation. The resulting regression coefficients are inpvjts to 
the. calculation of the parameters of the differential-equation 
model. 

- - Commentary . 

t 

A large part of the contents of this volume apply .to " topics 
other than development of career expectations. Any system of 
variables for which' a linear differential equation systehi sup— 
plies a good initial hypothesis can be "studied in the manner 
suggested in this volume. Examples iijclude development of 
political preferences, racial prejudice, and -prediction of mi- 
gration. If two panels of data are available on a system of 
variables', the estimation techinique suggested in this volume 
may be applied. 

The potential contribution of thi^ory .testing based on fore- 
casts has been emphasized. It seems likely, however, that at- 
tempts to verify theory u^ing even short-term forecasts will 
fail to yield convincing support of theory. Hopefully, such 
failures will stimulate imaginative revision of theory in which 
more nuances of the processes under .study are incorporated into 
formal theoretical statements expressed in dynamifc form. 
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